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EXPRESSION OF HYDROPHOBIC PROTEINS 

Field of the Invention: 

This invention relates to the expression of non-native (ie heterologous) polypeptides 
5 which comprise a proportion of hydrophobic amino acids in an expression system such as 
a bacterial host (eg R coti). In particular, the invention, provides a method for designing 
polyepitope polypeptides with an increased probability of being efficiently expressed (ie in 
amounts detectable by SDS-PAGE). One particular application of the invention relates to 
the production of a polyepitope polypeptide comprising cytotoxic T-cell lymphocyte (CTL) 

10 epitopes from Epstein-Barr virus (EBV) for use in a polyepitope vaccine capable of eliciting 
a CTL immune response for the prevention of diseases associated with EBV (eg infectious 
mononucleosis (IM) and nasopharyngeal carcinoma (NPC)). Other particular applications 
relate to the production of polyepitope polypeptides suitable for use in polyepitope 
vaccines for preventing and /or treating hepatitis C virus (HCV) and human 

15 immunodeficiency virus (HIV). 



Background of the Invention: 

In order to maximise production of recombinant polypeptides in a bacterial host (eg 
B. coil), a number of parameters can be considered including factors affecting transcription 

20 (eg promoter choice, etc) and factors affecting translation mechanisms such as rnirumising 
the use of rare codons. However, these are unlikely to have an impact on the production of 
recombinant polypeptides comprising stretches of hydrophobic amino acids, which have 
traditionally proven difficult to produce in recombinant bacterial expression systems. 
Indeed, in the case of polypeptides comprising transmembrane sequences, the removal of 

25 these hydrophobic sequences generally improves yields of the recombinant molecule 
(Frace et al, 1999; Hobman et al, 1994; Polte et al, 1991; EMBL website-protein toxicity: 
www.embl-heidelberg.de/ExternalInfo). 

The most likely reason for problems occurring in the production of foreign 
polypeptides possessing regions of hydrophobicity (particularly those with non-native 

30 sequences such as fusion proteins), is the post-translational association of nascent 

polypeptides with chaperone proteins such as R coJigcoEL GroEL is involved in the 
refolding process of polypeptides emerging from the ribosome and proteins will recycle 
through the chaperone system until the correct conformation is achieved or the protein is 



targeted for dj^adation :^ ; ^toEL is fcnpwn to bind hydrophobic amino aads and part of 
1 the refolding process is essentially to bury these hydrophobic sequences within the interior 
of the protein (Fisher and Yuan, 1994; Zahn and Pluckthun, 1994; Hayer-Harti eUl, 1994; 
Richarme and Kohiyama, 1994; Hendrick and Hartl, 1995,; Lin et al, 1995) 
5 Polyepitope or "pqlytope™" constructs (ie polypeptides comprising a tandem array 

of epitopes which may be contiguous or otherwise spaced apart by short intervening 
amino aad sequences of, for example, 1 to 5 amino acids in length), would be expected to 
be inherently unable tp internalise any hydrophobic regions as they are not naturally- 
pecurring sequences and lack the folding capabilities inbuilt in naturally-occurring 
1Q proteins. Hence polypeptides which consist of non-native sequences, particularly those , 

., ch^perpne folding pathway and ultimately targeted for degradation if a certain degree of 
conformational stability carmot be achieved. 

Pplyepitope vaccines typically comprise one or more polypeptides each made up of 

15 a tandem array of CTL epitopes. These CTL epitopes, particularly those of the HLA A2 
type, often comprise predominantly hydrophobic amino acids and since HLA A2 is 
represented in over 40% of the human population it is mandatory that these epitopes be 
included in any effective polyepitope vaccine formulation. 

Examples of polyepitope vaccines are described in Australian Patent No 736336, 
• 20 the entire disclosure of which is to be regarded as being incorporated herein by reference. 
In this patent, vaccines are described which comprise a synthetic or recombinant 
polypeptide, or a recombinant vaccinia virus or DNA vaccine encoding same, wherein the 
synthetic or recombinant polypeptide typically comprises a tandem array of CTL epitopes 
(eg a tandem array of 2 to 10 CTL epitopes) wherein at least two of the CTL epitopes are 

25 contiguous or spaced apart by intervening sequences in which the intervening sequences 
do not comprise any substantial lengths of naturally occurring flanking sequences of the 
epitopes. Particularly described in the prior patent are vaccines comprising a polyepitope 
vaccinia virus encoding a polyepitope polypeptide comprising nine CTL epitopes (each 
CTL epitope being of 9 to 10 amino acids in length) from EBV. Standard chromium release 

30 assays conducted with this virus in a panel of target cells expressing the HLA alleles for 
restriction of each epitope and using autologous CTL clones specific for each epitope as 
effector cells, showed that each epitope could be efficiently processed from the polyepitope 
polypeptide since, in all cases, the CTL clones recognised and killed the HLA matched 
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target cell infected with the polyepitope vaccinia Virus, but did not kill any of the negative 
controls (ie TK-vaccinia). 

Further examples of polyepitope vaccines are described in International patent 
specification WO 01/47541, the entire disclosure of which is to be regarded as being 
5 incorporated herein by reference. In this specification, vaccines are described which 

comprise multiple HLA epitopes wherein the multiple HLA epitopes have been sorted so 
as to minimise the number of "junctional epitopes" (ie epitopes inadvertently created by the 
juxtaposition of two other epitopes) and wherein flanking amino acid residues are 
introduced wherever junctional epitopes are unavoidable. 

10 Polyepitope vaccines may include a large number of CTL epitopes (eg 10 or more) 

so that the HLA diversity of the target population is covered. It has therefore been 
contemplated by the present applicants to produce an EBV polyepitope vaccine which 
includes EBV epitopes restricted by HLA A2, A3, All, A23, A24, B7, B8, B27, B35, B44, B46, 
B57, B58, B60 and B62, so as to provide protection against EBV in over 90% of the human 

15 population. This would involve the incorporation of about 26 EBV CTL epitopes irito a 
polyepitope polypeptide. For the reasons given above, it was expected that such a 
polypeptide would contain hydrophobic regions and that expression in a suitable host 
could be highly problematical. 

The work leading to the present invention was aimed at elucidating a method or 

20 procedure for overcoming the difficulties of expressing non-native polypeptides which 
comprise a proportion of hydrophobic amino acids (eg polyepitope polypeptides) in a 
bacterial host such as E. colL The present applicants have, as a result of that work, 
identified a novel method for designing candidate polyepitope polypeptides with an 
increased probability of being efficiently expressed in a bacterial host and/ or yielding a 

25 purified polyepitope polypeptide which is soluble in aqueous solutions. The identification 
of this method arose out of a recognition of a need for individual epitopes to be arranged in 
a non-random way within a polyepitope polypeptide so that regions of hydrophobicity are 
distributed more evenly throughout the molecule rather than clustered in one or more 
particular regions. The novel method therefore involves identifying one or more 

30 hydrophobic peptide s equences within a polypeptide and arranging or re-locating at least 
one of the hydrophobic peptide sequence(s), so as to; (a) reduce or minimise amplitude (ie 
peaks) in hydrophobicity across the length of the polypeptide, and/ or (b) reduce or 
minimise the total length of any hydrophobic region(s) within the polypeptide. To assist in 
the utility of this method, the present applicants have also identified an algorithm which 
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permits any number of epitopes to be readily ordered into a polyepitope polypeptide 
sequence lacking or having reduced regions of relative high hydrophobicity. 

Summary of the Invention: 

5 In a first aspect, the present invention provides a method for designing a candidate 

polypeptide for expression in a prokaryotic or eukaryotic host, said method comprising, 
identifying one or more hydrophobic peptide sequences within a polypeptide of 
interest and 

arranging or re-locating at least one of said hydrophobic peptide sequences within 
10 said polypeptide so as to generate said candidate polypeptide with reduced amplitude in 
hydrophobicity and /or length of any hydrophobic region(s). 

Preferably, the polypeptide of interest is non-native to the intended host Since the 
most preferred host is R coli, most preferably the polypeptide is non-native to R coli. 

The polypeptide of interest will preferably be a non-natural polypeptide or even a 
15 theoretical non-natural polypeptide (ie a polypeptide yet to be synthesised or expressed) 
comprising a plurality of amino acid sequences of interest some of which may be 
hydrophobic or suspected to be hydrophobic, and which has been found not to be, or is 
suspected not to be, efficiently expressed in said host. For such a polypeptide of interest, 
the method of the first aspect provides the possibility of identifying one or more 
20 hydrophobic peptide sequences, if any, within the polypeptide of interest and arranging or 
re-locating at least one of the hydrophobic peptide sequence(s) so as to generate a 
candidate polypeptide with reduced amplitude in hydrophobicity and/ or length of any 
hydrophobic region(s), and therefore an increased probability of being efficiently expressed 
in a suitable host. 

25 Preferably, the polypeptide of interest may be a synthesised or theoretical 

polyepitope polypeptide comprising a tandem array of epitopes of interest (eg CTL 
epitopes, which, as is mentioned above, often predominantly comprise hydrophobic amino 
acids). For such a polypeptide of interest, the method of the first aspect permits the design 
of candidate polyepitope polypeptides comprising a large number of epitopes of interest 

30 (eg 5 to 100 or more) with an increased probability of being efficiently expressed in a 

suitable host, by enabling the possibility of identifying one or more hydrophobic epitopes 
and arranging or re-locating at least one of the hydrophobic epitope(s), so as to generate a 
candidate polyepitope polypeptide with reduced amplitude in hydrophobicity and /or 
length of any hydrophobic region(s). 
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It has been found that the method of the first aspect is best applied to the design of 
a candidate polyepitope polypeptide in a manner which identifies and ranks the relative 
hydrophobicity of each of the selected epitopes (nb The epitopes of interest may be a range 
of epitopes from a single pathogen (eg EBV) selected to provide a polyepitope polypeptide 
5 that covers the HLA diversity of the target population. The epitopes of interest may also be 
one or more epitopes from a range of pathogens or the epitopes may be derived from a 
non-microbial source such as a tumour cell for treating or preventing cancer.), groups the 
ranked epitopes into three groups of substantially equivalent numbers, based upon the 
identified relative hydrophobicity (ie so as to produce the groups, Group 1 = most 

10 hydrophobic Group 2 = middle hydrophobicity, and Group 3 = least hydrophobic and 
"residual" epitopes where the total number of epitopes (N) is not wholly divisible by 3), 
and then arranges the epitopes into triplets where the triplets contain an epitope from each 
group (ie three linked epitopes; epitope 1 - epitope 2 - epitope 3) and arranged into a 
candidate polyepitope polypeptide having the formula, Triplet 1 - Triplet 2 - .... - Triplet 

15 N/3, as follows: 





Epitope 1 


Epitope 2 


Epitope 3 


Triplet 1 (N-terminal) 


Most hydrophilic of 
Group 2 


Most hydrophobic of 
Group 1 


Most hydrophilic of 
Group 3 


Triplet 2 


2 nd most hydrophilic 
of Group 2 


2 nd most hydrophobic 
of Group 1 


2 nd most hydrophilic 
of Group 3 










Triplet N/3 (C- 
terminal) 


Most hydrophobic of 
Group 2 


Most hydrophilic of 
Group 1 


Most hydrophobic of 
Group 3 



(Any "leftover" epitope(s) (ie least hydrophilic epitopes) of Group 3) may be added to the C-terminaJ of Triplet N/3, or 
otherwise may be located within the candidate polyepitope polypeptide sequence so as to reduce any local peaks of 
20 hydrophobicity.) 

Between the epitope triplets, or between any or all of the epitopes within a triplet 
there may be intervening sequences (preferably short sequences of 1 to 10 amino acids) 
which may optionally be hydrophilic (eg lysine-lysine) so as to reduce any local peaks of 
25 hydrophobicity. Preferably, the epitopes within a triplet are contiguous. 

Other simple methods for arranging the epitope(s) so as to minimise extremes in 
hydrophobicity in a polyepitope polypeptide will be readily apparent to persons skilled in 
the art, and are to be considered as forming part of the present invention. For example, in 
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one variation of the above method, and referring again to Table 1, epitope 1 would instead 
be selected using the stated criteria for epitope 2, epitope 2 would instead be selected using 
the stated criteria for epitope 3, and epitope 3 would instead be selected using the stated 
criteria for epitope 1. In another variation, the epitopes would be selected from four 
5 groups of ranked epitopes and consequently arranged into sets of 4 epitopes. 

As mentioned above, the method of the first aspect generates a candidate 
polypeptide with reduced amplitude in hydrophobicity and/ or length of any hydrophobic 
region(s). In the context of applying the method to a natural polypeptide, "reduced 
amplitude in hydrophobicity" is to be understood to mean that any peaks of 

10 hydrophobicity of the candidate polypeptide (ie as may be calculated /measured using 

Pinspft 2 from Mimotopes Pty Ltd, Melbourne, Australia) is reduced relative to the natural 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 
mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/are reduced relative to the natural polypeptide. In the context of applying 

15 the method to a non-natural polypeptide (including a theoretical non-natural polypeptide), 
"reduced amplitude in hydrophobicity" is similarly to be understood to mean that any 
peaks of hydrophobicity of the candidate polypeptide (ie as may be calculated /measured 
using Pinsoft 2 from Mimotopes Pty Ltd) is reduced relative to the non-natural 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 

20 mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/ are reduced relative to the non-natural polypeptide. In the context of 
applying the method to the more specific non-natural polypeptide example of a 
polyepitope polypeptide, "reduced amplitude in hydrophobicity" is to be understood to 
mean that any peaks of hydrophobicity of the candidate polypeptide (ie as may be 

25 calculated using the mathematical expression described below) is reduced relative to most 
of the possible random arrangements of the epitopes comprising the polyepitope 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 
mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/ are reduced relative to most of the possible random arrangements of the 

30 epitopes within the polyepitope polypeptide. 

Once a candidate polypeptide has been designed in accordance with the method of 
the first aspect, a polynucleotide encoding the candidate polypeptide may be synthesised 
according to any of the methods well known to persons skilled in the art The encoding 
polynucleotide may be incorporated into, for example, vectors such as viral vectors (eg 
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vaccinia to provide a recombinant polyepitope viral vaccine) or expression vectors such as 
those suitable for expression in a suitable host. 

Thus, in a second aspect, the present invention provides a method of expressing a 
polypeptide in a suitable host, said method comprising, 
5 designing a polypeptide in accordance with the method of the first aspect, 

introducing a polynucleotide encoding said polypeptide into said host, such that 
said host is capable of expressing said polypeptide, and 

culturing said host under conditions suitable for expression of said polypeptide. 

The expressed polypeptide may be isolated by, for example, lysing the host cell and 
10 purifying the polypeptide from the produced cell lysate. 

The polynucleotide introduced into the host cell may encode the polypeptide in the 
form of a fusion of the polypeptide with a suitable carrier protein. Alternatively, the 
polypeptide could be expressed and subsequently linked to or otherwise associated with a 
suitable carrier protein. Suitable carrier proteins are well known to persons skilled in the 
15 art and include fi-galactosidase, glutathione S-transf erase and the gp350 structural protein 
from EBV or a fragment thereof. The carrier protein may comprise additional useful 
epitopes. Further increases in expression benefits provided by ordering may be conferred 
by the carrier protein. 

In a third aspect the present invention provides a polypeptide designed in 
20 accordance with the method of the first aspect 

If desired, the polypeptide of the third aspect may be in the form of a fusion of the 
polypeptide with a suitable carrier protein. 

In a fourth aspect the present invention provides a polyepitope polypeptide 
designed in accordance with the method of the first aspect 
25 If desired, the polypeptide of the fourth aspect may be in the form of a fusion of the 

polypeptide with a suitable carrier protein. 

In a fifth aspect, the present invention provides a polyepitope polypeptide 
comprising N epitopes, wherein N is any integer (preferably an integer in the range of 5 to 
100, and more preferably, 10 to 35), said polyepitope polypeptide having the formula; 
30 Triplet 1 - Triplet 2 - .... - Triplet N/3, 

wherein each of said triplets comprises three linked epitopes selected by, 

identifying and ranking the relative hydrophobicity of each of the N epitopes, 

grouping the ranked N epitopes into three groups of substantially equivalent 
numbers, based upon the identified relative hydrophobicity of the N epitopes, to produce a 
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first group (ie Group 1) comprising the most hydrophobic epitopes, a second group (ie 
Group 2) comprising the epitopes having a middle level of hydrophobicity, and a third 
group (ie Group 3) comprising the least hydrophobic epitopes, and 

selecting the epitopes for each of said triplets according to the following table: 



5 





Epitope 1 


Epitope 2 


Epitope 3 


Triplet 1 (N-terminal) 


Most hydrophilic of 
. Group 2 


Most hydrophobic of 
Group 1 


Most hydrophilic of 
Group 3 


Triplet 2 


. 2 nd most hydrophilic 
of Group 2 


2 nd most hydrophobic 
of Group 1 


2 nd most hydrophilic 
of Group 3 










Triplet N/ 3 (C- 
terminal) 


Most hydrophobic of 
Group 2 


Most hydrophilic of 
Group 1 


Most hydrophobic of 
Group 3 



Preferably, the first second and third groups comprise identical numbers of 
epitopes. Where N is an integer not wholly divisible by 3 (ie an integer other than, for 
example, 6, 9, 12, 15, and 18), then the residual epitopes are preferably included within the 
10 third group. 

At the end of the step of selecting epitopes for each of said triplets, if there is / are 
any leftover epitope(s) (ie least hydrophilic epitope(s) of Group 3), then this /these may be 
added to the C-terminal of Triplet N/3, or otherwise may be located within the candidate 
polyepitope polypeptide sequence so as to reduce any local peaks of hydrophobicity. 

15 Between the epitope triplets, or between any or all of the epitopes within a triplet 

there may be intervening sequences (preferably short sequences of 1 to 10 amino acids) 
which may optionally be hydrophilic (eg lysine-lysine) so as to reduce any local peaks of 
hydrophobicity, or otherwise avoid the creation of junctional epitopes. Preferably, the 
epitopes within a triplet are contiguous. 

20 If desired, the polyepitope polypeptide of the fifth aspect may be in the form of a 

fusion of the polyepitope polypeptide with a suitable carrier protein. 

In a sixth aspect, the present invention provides a polypeptide vaccine comprising a 
polyepitope polypeptide according to the fourth or fifth aspect and a pharmaceutically 
acceptable carrier and /or adjuvant. 

25 In a seventh aspect, the present invention provides a polyepitope polypeptide 

comprising an amino acid sequence substantially corresponding to an amino acid sequence 
selected from the group consisting of: 
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FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - VQPPQLTLQV- 
GLCTLVAML - RLRAEAQVK -IEDPPFNSL -YLLEMLWRL - GQGGSPTAM - 
AVLLHEESM - IALYLQQNWWTL-RAKFKQLL - SSCSSCPLSKI- TYGPVFMCL- 
5 QAKWRLQTL - RPPIFIRRL- VSFffiFVGW - YPLHEQHGM -VEITPYKPTW - 

CLGGLLTMV - EENLLDFVRF - TYSAGF/QI - LLDFVRFMGV -EGGVGWRHW 
(SEQIDNO:l), 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - GLCTLVAML - 
10 RLRAEAQVK - IEDPPFNSL -TYSAGIVQI - LLDFVRFMGV - EGGVGWRHW - 

IALYLQQNWWTL - RAKFKQLL - SSCSSCPLSKI - TYGPVFMCL - QAKWRLQTL- 
RPPIFIRRL - VSFIEFVGW - YPLHEQHGM - VEITPYKPTW - CLGGLLTMV - 
EENLLDFVRF - YLLEMLWRL - GQGGSPTAM - AVLLHEESM - VQPPQLTLQV 
(SEQ ID NO:2), 

15 

SSCSSCPLSKI - HRCQAIRKK - CLGGLLTMV - LTAGFLIFL - RLRAEAQVK - 
IEDPPFNSL - LLSAWILTA - RRRWRRLTV - PYLFWLAAI - YLLEMLWRL - 
GQGGSPTAM - VMSNTLLSAW - ALLVLYSFA - RAKFKQLL - IALYLQQNW - 
TYGPVFMCL - QAKWRLQTL - YLQQNWWTL - YPLHEQHGM - CPLSKTLL 
20 (SEQIDNO:3), 

IPIVAJVALV - RLRPGGKKK - ILKEPVHGV - PLVKLWYQL - RPGGKKKYKL - 
KYKLKHIVW - TWETWWTEYW - EEKDTKEAL - KRWULGLNK - 
KLWVTVYYGV - KIEELRQHL - MTNNPPIPV - VTLWQRPLV - WASRELERF - 
25 LLWKGEG AV - YTAFTIPSI - IYQEPFKNLK - SLYNTVATL - AURILQQL - 

AIFQSSMTK - VTYQYMDDL - LVGPTPVNI - TPQDLNTML - YLAWVPAHK - 
ALVEICTEM - TLNAWVKW (SEQ ID NO:4), 



30 



and 



LLFNTLGGWV - KTSERSQPR - FLLLAD ARV - LLFLLLADA - RLGVRATRK - 
G VAG ALVAFK - LPGCSFSIF - RMYVGGVEHR - VAG ALVAFK - DLMGYIPLV - 
LIFCHSKKK - ILAG YGAGV - HMWNFISGI - QLFTFSPRR - VGIYLLPNR - 
FWAKHMWNF - YLVTRHADV - LSAFSLHSY - WMNRLIAFA - YLLPRRGPRL - 



10 

YLVAYQATV - RLIVFPDLGV - TLGFGAYMSK - EPFYGKAI - VLVGGVLAA - 
CTCGSSDLY (SEQ ID NO:5). 

In an eighth aspect, the present invention provides a polyepitope vaccine 
5 comprising a polyepitope polypeptide according to the seventh aspect and a 
pharmaceutkally acceptable carrier and / or adjuvant 

Detailed Description of the Invention: 

The present applicants have identified novel methods for designing a candidate 
10 polyepitope polypeptide, with an increased probability of being efficiently expressed in a 
prokaryotic or eukaryotic host (ie in amounts detectable by SDS-PAGE). The method 
involves identifying one or more hydrophobic epitope(s) and arranging or re-locating at 
least one of the hydrophobic epitope(s) so as to generate a candidate polyepitope 
polypeptide with reduced amplitude in hydrophobicity and/or length of any hydrophobic 
15 region(s); 

An algorithm to calculate hydrophobicity values of amino acid sequences and 
subsequently arrange sequences to; (a) reduce or minimise amplitude in hydrophobicity, 
and /or (b) reduce or minimise the length of hydrophobic sequences, was generated and 
initially applied to 26 CTL epitope sequences from EB V. This resulted in the design of two 

20 initial candidate polyepitope polypeptides (designated PT26A and PT26B, described 
hereinafter), one of which proved to be efficiently expressed in R colL The expressed 
polyepitope polypeptide shows promise as the basis of an EBV vaccine for prevention or 
treatment of infectious mononucleosis and /or EBV-related cancers such as Burkitts 
lymphoma, Hodgkin's disease, non-Hodgkin's lymphoma, nasopharyngeal carcinoma, 

25 gastric adenocarcinoma, lymphomas associated with immunosupression, 

lymphoepithelioma-like carcinomas, and immunodeficiency-related leiomyosarcoma. 

While looking for an explanation as to why the expression capabilities of R colilor 
the two similar candidate polyepitope polypeptides were different summations of 
hydrophobicity values (designated Hydrophobic Index (HI) values) were calculated for 

30 different numbers of epitopes over the length of the candidate polyepitope polypeptides to 
identify local areas of hydrophobicity. Summation over 3 and 4 epitopes showed that there 
were regions in the non-expressed polypeptide where the HI value was higher than in the 
expressed polypeptide. This information enabled the identification of a threshold HI 
value, such that polypeptide sequences which comprised a region with an HI value in 
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excess of the threshold value, could be predicted as being less likely of being efficiently 
expressed in a bacterial host 

Thus, in a preferred embodiment of the methods of the present invention for 
designing candidate polypeptides, the methods involve initially calculating hydrophobicity 
5 values and arranging peptide sequences to; (a) reduce or minimise amplitude in 

hydrophobicity, and (b) reduce or minimise the length of hydrophobic sequences; and then 
"fine-tuning", if necessary, by calculating the HI values over different peptide sequence 
groups, thus providing numerical values for comparison and prediction of the likelihood of 
a candidate polypeptide sequence being efficiently expressed in a bacterial host So, in 
10 applying this preferred embodiment to the design of a candidate polyepitope polypeptide, 
the method involves: 

(i) Calculating the hydrophobic value for each epitope using a suitable algorithm (eg 
Fauschere and Pliska, 1983 contained within the software package Tinsoft 2" from 
Mimotopes Pty Ltd). 

15 (ii) Ranking the set of epitopes in order of decreasing hydrophobicity. 

(iii) Dividing the rank ordered set of epitopes into a number of equal groups (eg three 
equal groups wherein group 1 = most hydrophobic, group 2 « middle hydrophobicity and 
group 3 = least hydrophobic (most hydrophilic)), and including any residual epitopes in 
the most hydrophilic group. 

20 (iv) Creating sets (eg triplets) of epitopes by taking, in for example a case where the 
epitopes have been divided into three groups, the most hydrophilic epitope of group 2 (ie 
last in group 2), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the 
most hydrophilic epitope (ie last in group 3) until all epitopes in groups 1 and 2 have been 
used (nb ,T Lef tover n epitopes are handled as set out in step (ix) below). 

25 (v) Arranging the sets of epitopes (eg triplets) into a sequence in the order in which 
they were produced (eg Triplet 1 - Triplet 2 - Triplet 3 - etc). 

(vi) Plotting the hydrophobicity of the arranged polyepitope polypeptide sequence 
using a suitable algorithm (eg Fauschere and Pliska, 1983, or Hopp and Woods, 1981). 

(vii) If necessary, reducing hydrophobic amplitude by re-locating sets of epitopes (eg 
30 triplets) from areas of low hydrophobicity into areas of high peak hydrophobicity and /or 

by re-locating individual hydrophobic (ie group 1) epitopes from areas of peak 
hydrophobicity into areas of low hydrophobicity. 

(viii) Re-calculating the hydrophobicity plots and continuing, if necessary, to shuffle sets 
of epitopes (eg triplets) as in step (vii) above to generate a final sequence arrangement. 
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(ix) Placing any leftover epitopes (eg least hydrophiiic epitopes of group 3) at the C- 
terminal of the final sequence arrangement or other location so as to further reduce local 
peaks in hydrophobicity (ie by inserting them adjacent to epitopes of peak 
hydrophobicity). 

5 (x) Placing any affinity tags (usually hydrophiiic, eg a hexa-histidine sequence) at 
either the N- or C- terminal of the final polyepitope polypeptide sequence or at the C- 
terminal if the final polyepitope polypeptide sequence is to be expressed as a fusion 
protein. 

The HI values may be calculated by using the mathematical expression: 
10 e-m+n-1 

e=m 

(where m = group number, n - group size, e = integer (epitope) number, x = epitope 
hydrophobicity value). 

15 Preferably, the HI values are calculated using this mathematical expression when 

n=3 and n=4. In the examples provided hereinafter, this calculation predicted that to be 
able to express linked, random, short aurtino acid sequences in E coliin SDS-PAGE 
detectable amounts, the hydrophobic index over groups of three epitope sequences would 
need to be less than 1.8 (HI 3 < 1.8) and/ or that over groups of four epitope sequences, the 

20 hydrophobic index would need to be less than 2.5 when x was calculated using Pinsoft 2 
(Mimotopes Pty Ltd) and specifying the N-terminus as N-acetyl and the C-terminus as 
carboxy amide. Different cut-off values will be obtained with different hydrophobicity 
algorithms. 

It will be readily appreciated that the calculation of HI values in this manner, would 
25 be useful for predicting whether a natural, non-bacterial polypeptide or a derivative 
thereof may be efficiently expressed in F. coli. The present invention therefore further 
extends to a method of predicting efficient expression of a polypeptide in a suitable host 
(eg a bacterial host), involving calculating HI values in accordance with the mathematical 
expression: 

30 e=m+n-l 

HI ro = Sx, 
e==m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value). 
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The methods of the present invention permit the design of candidate polyepitope 
polypeptides comprising a large number of epitopes of interest (eg 5 to 100 or more) with 
an increased probability of being efficiently expressed in a suitable host (eg a bacterial 
host). The epitopes of interest may be a range of epitopes from a single pathogen selected 
5 to provide a polyepitope polypeptide that covers the HLA diversity of the target 
population, or the epitopes of interest may be one or more epitopes from a range of 
pathogens or tumour antigens. As is evident from the above, one particular application of 
the methods of the present invention is to the design of candidate polyepitope 
polypeptides comprising 26 EBV CTL epitopes for use in a vaccine to provide protection 

10 against EBV in over 90% of the human population, or for treating diseases associated with 
EBV such as NPC. Other particular applications of the methods of the present invention 
relate to the design of candidate polyepitope polypeptides suitable for use in polyepitope 
vaccines for preventing and /or treating HCV and HIV. Another particular application of 
the methods of the present invention is to the design of candidate polyepitope 

15 polypeptides comprising CTL epitopes from cytomegalovirus (CMV), for use in a vaccine 
to prevent or treat CMV-causative diseases. 

A candidate polypeptide designed in accordance with the methods of the present 
invention may be expressed by firstly synthesising a polynucleotide encoding the 
candidate polypeptide according to any of the methods well known to persons skilled in 

20 the art, and then by introducing the polynucleotide into a suitable host. Typically, this will 
be achieved by cloning the polynucleotide into an expression vector and then introducing 
the expression vector into said host by any of the transformation methods well known to 
persons skilled in the art. Expression from the expression vector may result in the 
polypeptide being expressed as a fusion protein comprising the polypeptide and a suitable 

25 carrier protein (eg fi-galactosidase, glutathione S-transferase or the gp350 structural protein 
from EBV or a fragment thereof). Alternatively, the polypeptide may be expressed by the 
host cell, and following isolation of the polypeptide, the polypeptide may be linked to or 
otherwise associated with a suitable carrier protein. The carrier protein may also confer 
additional useful properties (ie the carrier protein may comprise useful epitopes or 

30 sequences to enhance solubility, further enhance purification procedures, facilitate 
association with an adjuvant or to which an immune response is desirable). 

It is further contemplated that candidate polypeptides designed in accordance with 
the methods of the present invention may be expressed in prokaryotic expression systems 
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other than E. coll Typical alternative systems include B. subtiUs, Salmonella sp., 
Streptococcus sp., Lactobacillus sp., and Streptomyces sp. 

It is also contemplated that candidate polypeptides designed in accordance with the 
methods of the present invention may be readily expressed in whole cell lysates and non- 
5 bacterial host cells as well, and accordingly such alternative expression methods for 

candidate polypeptides are to be considered as forming part of the present invention. In 
particular, the present invention is to be considered as extending to a method of expressing 
a polypeptide in a non-bacterial host cell such as a mammalian cell (eg a CHO cell or COS 
cell line), a yeast cell (eg Saccharomyces cerevislae) or insect cell (eg SF9 cell line), wherein 

10 the method comprises designing a polypeptide in accordance with the method of the first 
aspect introducing a polynucleotide encoding the polypeptide into the host cell such that 
the host cell is capable of expressing the polypeptide, and culturing the host cell under 
conditions suitable for expression of the polypeptide. The expressed polypeptide may be 
isolated from the host cell culture by lysing the cells and purifying the polypeptide from 

15 the produced cell lysate, or alternatively, the polypeptide could be expressed with a 
suitable secretion signal such that the polypeptide is secreted into the culture medium 
(from where it may be purified). Designing a polyepitope polypeptide in accordance with 
the methods of the present invention may also overcome non-secretion problems which are 
sometimes experienced when a hydrophobic polypeptide is expressed with a foreign 

20 secretion signal. 

Where the expressed polypeptide is of pharmacological or veterinary significance, 
the polypeptide may be formulated into a pharmaceutical or veterinary composition. 
Generally, such compositions will comprise a pharmaceutical^ acceptable or veterinary 
acceptable carrier, and may include other substances and excipients as may be required. 

25 Polyepitope polypeptides may be formulated into vaccine compositions. Generally, 

such compositions will comprise a pharmaceuticaUy acceptable or veterinary acceptable 
carrier and may include adjuvants (eg an ISCOM™ adjuvant, DEAE, polysaccharides, 
saponins, liposomes and virus-like particles), and other substances and excipients as may 
be required. The vaccine compositions may also include helper epitopes/ CD4 epitopes or 

30 B-cell epitopes. The vaccine compositions may be adapted for administration to a subject 
by, for example, intramuscular injection, nasal administration via an 'aerosol spray, or oral 
administration. Preferably, the vaccine compositions are ISCOM™ adjuvant compositions. 

Polyepitope polypeptides may also be administered to a subject in the form of a 
viral vaccine (eg a recombinant polyepitope vaccinia or adenovirus) or DNA vaccine. 
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Thus, in a further aspect, the present invention provides a polynucleotide vaccine 
comprising a polynucleotide encoding a polypeptide designed in accordance with the 
method of the first aspect, and a pharmaceutically acceptable carrier and /or adjuvant. 
Throughout this specification the word "comprise", or variations such as 
5 "comprises" or "comprising", will be understood to imply the inclusion of a stated element, 
integer or step, or group of elements, integers or steps, but not the exclusion of any other 
element, integer or step, or group of elements, integers or steps. 

The term "substantially corresponding" as used herein in relation to an amino acid 
sequence is intended to encompass the exact amino acid sequence as well as minor 

10 variations which do not result in a substantial decrease in the biological activity of the 
amino acid sequence (eg variations which do" not dirrrinish the ability of an epitope to 
provoke a CTL immune response). These variations may include one or more conservative 
amino acid substitutions. The conservative amino acid substitutions envisaged are: G, A, 
V,I,L,M; D,E,N,Q; 5, C, T; K, R, H; and P, Na-alkylamino acids. 

15 Any discussion of documents, acts, materials, devices, articles or the like which has 

been included in the present specification is solely for the purpose of providing a context 
for the present invention. It is therefore, not to be taken as an admission that any or all of 
these matters form part of the prior art base or were common general knowledge in the art 
relevant to the present invention as it existed in Australia or elsewhere before the filing or 

20 priority date of the present specification. 

The invention will hereinafter be further described by way of the following non- 
limiting examples and accompanying figures. 

Brief Description of the accompanying Figures: 

25 Figure 1 provides the epitope configuration and amino acid sequences for EBV- 

polyepitope polypeptides, FT26A and PT26B. Numbers above epitopes represent 
hydrophobicity values for each epitope calculated with Pinsoft 2 software, specifying an 
acetyl N^errninal and an amide C-terminal. 

Figure 2 provides hydrophobicity plots of PT26A and PT26B. Hydrophobicity 

30 values of a moving nine amino acid window are derived using the algorithm of Fauchere 
and Pliska, 1983. 

Figure 3 provides the amino acid sequence of the fusion of residues 21 to 447 of EBV 
gp350 to PT26A (A) or to PT26B (B). 
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Figure 4 shows Coomassie stained SDS-PAGE gels showing the time course of 
expression following induction with IPTG (+IPTG or I) of: (A) PT26A (at approximately 
30kDa), (B) gp350/PT26A (at approximately 80kDa), and (C) PT26B expression. Arrows 
indicate the location of recombinant protein. 
5 Figure 5 provides ELISPOT assay results of CTL responses to the five HLA A2 

epitopes contained within the polyepitope polypeptides, PT26A and PT26B, under two 
formulation conditions: (A) lOug gp350-PT26A, (B) lOug gp350-PT26B, and (C) positive 
control peptide mix containing a mixture of each of the 5 A2 epitopes contained in the 
polypeptide polypeptide. Each of the epitopes is represented below by their first 3 amino 
10 acids. The CTL response of each mouse }A1-M5 to each A2 epitope is presented as a bar 
indicating the number of IFN-y spots produced. 

Figure 6 provides the epitope configuration and amino acid sequences for an EBV 
polyepitope polypeptide, EBV-NPCa. Numbers above epitopes represent hydrophobicity 
values for each epitope calculated with Pinsoft 2 software, specifying an acetyl N-terminus 
15 and an amide C-terminus. 

Figure 7 provides the epitope configuration and amino acid sequences for HIV 
polyepitope polypeptides, HJVa and HTVb. ^Numbers above epitopes represent 
hydrophobicity values for each epitope calculated with Pinsoft 2 software, specifying an 
acetyl N-terminus and an amide C-terminus. Hydrophobic Index values across 3 epitopes 
(n = 3) are shown below. 

Figure 8 provides the epitope configuration and amino acid sequences for HCV 
polyepitope polypeptides, HCVa and HCVb. Numbers above epitopes represent 
hydrophobicity values for each epitope calculated with Pinsoft 2 software, specifying an 
acetyl N-terminus and an amide C-terminus. Hydrophobic Index values across 3 epitopes 
(n ~ 3) are shown below. 

Figures 9 and 10 show Coomassie stained SDS-PAGE gels and also immunoblots 
from an expression time course of constructs HIVa, HIVb, HCVa and HCVb. Figure 9 - 
Panel A: 1 hour timepoint Lane 1) Novex SeeBlue+2 MW markers; 2) Negative vector/ host 
control; 3) HTVa uninduced; 4) HIVa induced; 5) HIVb uninduced; 6) HIVb induced; 7) 
HCVa uninduced; 8) HCVa induced; 9) HCVb uninduced; 10) HCVb induced. Panel B: 2 
hour timepoint Lane 1) Negative vector/host control; 2) Novex SeeBIue+2 MW markers; 3) 
HTVa uninduced; 4) HIVa induced; 5) HIVb uninduced; 6) HTVb induced; 7) HCVa 
uninduced; 8) HCVa induced; 9) HCVb uninduced. Figure 10 - Panel A: 3 hour timepoint. 
Lane 1) Negative vector/ host control; 2) HIVa uninduced; 3) HTVa induced; 4)HIVb 
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uninduced; 5) HIVb induced; 6) Novex SeeBlue+2 MW markers; 7) HCVa uninduced; 8) 
HCVa induced; 9) HCVb uninduced; 10) HCVb induced. Panel B: Overnight timepoint 
Lane 1) Negative vector /host control; 2) HIVa uninduced; 3) HIVa induced; 4)HIVb 
uninduced; 5) HIVb induced; 6) HCVa uninduced; 7) HCVa induced; 8) HCVb uninduced; 
5 9) HCVb induced; 10) Novex SeeBlue+2 MW markers. 

Example 1: EBV polyepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 

Epitope sequences 

10 The 26 CTL epitopes for inclusion in an EBV vaccine, the proteins from which they 

originate and HLA type are shown in Table 1. 



TABLE 1. CTL epitopes included in the EBV polytopes 



HLA Type 


EBV Protein 


jcpitope 


A2 


LMP2 


GLGGLL1MV ^d.bQ iu iN^.o; 




BMLF1 


GLCTLVAML (bbQ id i\u./; 




EBNA6 


LLDFVkFMGV (bbQ ID JNU.o; 




LMP1 


YLLEML WRL (SEQ ID JNU.y; 




LMP1 


YLQQNWWTL (SEQ ID NU.IU; 


A3 


EBNA3 


RLRAEAQVK (SEQ ID INU.li; 


All 


LMP2 


r>r>or»rrtm cyst /CCA TT"\ KT/^N.T *")\ 

SSCSSCPLSKI (SEQ ID NvJ.lZ; 


A23 


LMP2 


PYLFWLAAI (SEQ ID NU:13; 


A24 


LMP2A 


»r\/OT« 7TTK Kf^l tC~\2l~\ TT\ KTf"V1 A\ 

TYGPWMCL OlsQ ID JNO.l4fc; 




EBNA4 


TYSAGIVQI (SEQ IU NU:lo; 


B7 


EBNA3 


RPPIFIRRL (SEQ ID NU:lo; 


B8 


EBNA3 


FLRGRAYGL (SEQ ID NO:17) 




EBNA3 


QAKWRLQTL (SEQ ID NO:lb) 




BZLF1 


RAKFKQLL (SEQ ID JNUriy; 


B27 


EBNA4 


HRCQAIRKK (SEQ ID NvJizU; 




EBNA6 


RRlYDLIEL (SEQ ID XMU.ZI; 


B35 


EBNA4 


AVLLHEESM (SEQ ID JNU.ZZ; 




EBNA3 


Yr LHJbQri.L>IVl wiiQ ID inlj.zo; 


B44 


EBNA6 


EENLLDrVKr \bils<? ID lNU^j 




. EBNA6 


IILAjVVj VYivtlVV ^DEV^ IU IMUxj/ 




EBNA4 


VETTPYKPTW (SEQ ID NO:26) 


B46 


EBNA3 


! VQPPQLTLQV (SEQ ED NO:27) 


B57 


LMP1 


IALYLQQNW (SEQ ED NO:28) 


B58 


EBNA4 


VSFIEFVGW (SEQ ED NO:29) . 


B60 


LMP2 


IEDPPFNSL (SEQ ED NO:30) 


B62 


EBNA4 


GQGGSPTAM (SEQ ED NO:31) 
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Design/Ordering of epitopes 

The following method was used to generate ordered arrangements of CTL epitopes 
to produce a polyepitope sequence with favourable hydrophobicity characteristics: 

(i) The hydrophobic value for each epitope was calculated using a suitable algorithm 
(ie Pinsoft 2 from Mimotopes Pty Ltd, specifying the N-terrninus as N-acetyl and the C- 
terminus as carboxy amide). 

(ii) The set of epitopes was ranked in order of decreasing hydrophobicity. 

(iii) The rank ordered set of epitopes was divided into 3 equal groups (ie group 1 « most 
hydrophobic, group 2 = middle hydrophobicity and group 3 = least hydrophobic (most 
hydrophilic)). Residual epitopes (ie 2 epitopes left over after the set of 26 was divided by 
3), were included in the most hydrophilic group. 

(iv) Triplets of epitopes were created by taking the most hydrophilic of group 2 (ie last 
in group 2), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the 
most hydrophilic (ie last in group 3). This was continued until all epitopes in groups 1 and 
2 had been used (nb "Leftover" epitopes were added to the C-terminal end of the final 
sequence arrangement). 

(v) The triplets were then arranged into a sequence in the order in which they were 
produced (ie Triplet 1 - Triplet 2 - Triplet 3 - etc). 

(vi) The hydrophobicity of this triplet arrangement was then plotted using a suitable 
algorithm (ie Fauschere and Pliska). 

(vii) If and where necessary, relocating triplets from areas of low hydrophobicity into 
areas of high peak hydrophobicity in order to reduce hydrophobic amplitude. 

(viii) Re-calculating the hydrophobicity plots and continuing, if necessary, to shuffle 
triplets as in the step (vii) above. 



20 

(ix) Any residual epitopes (ie least hydrophilic of group 3) can be placed C-terminally 
in the final sequence arrangement or can be used to further reduce local peaks in 
hydrophobicity by inserting them adjacent to epitopes of peak hydrophobicity, according 
to a hydrophobicity plot of the assembled triplets. 

5 

(x) Any affinity tags (usually hydrophilic, eg a hexa-histidine sequence) should be 
located either N- or C- terminally or preferably C-terminally if the construct is a fusion 
protein. 

10 (xi) Confirmation of satisfactory HI. 

Using this process, EBV polyepitope configurations PT26A and PT26B were created. 
The example of PT26 A is shown in Table .4. 

15 Hydrophobic Index (HI) calculations 

HI values for favourable configurations (PT26A, PT26B) were calculated according 
to the mathematical expression:- 

e=m+n-l 
Hl^ZXe 

20 e=m . 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value) over values of n from 2 to 5. 

Preparation of recombinant proteins 

25 DNA sequences encoding the polyepitopes (PT26A and PT26B) were generated 

from synthetic oligonucleotides using a Splicing by Overlap Extension technique (SOE) as 
described by Horton et al (1995). The codon usage was optimised for R coli expression 
(Wada et al 1992). The polyepitopes were tagged at the C-terminus with a hexa-histidine 
tag for protein purification and detection. The DNA was subcloned into pET28b 

30 (Novagen) and transformed into E coli BL21(DE3) cells (Novagen) for expression. 

A fragment corresponding to the N-terminal region (amino acid residues 21 - 447) 
of EBV gp350 was amplified from plasmid DNA containing the full length gp350 sequence 
by PCR using the following oligonucleotides: 

5' AGGGATCCCATGGAAGATCCTGGTnTTTC 3' (forward) (SEQ ID NO:32) and 
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5' TCTAG AGGTCGACACCTGTCGTTGTATTGGG 3' (reverse) (SEQ ID NO:33). This 
DNA fragment was subdoned intopET28b (Novagen) containing the polyepitope insert, 
resulting in an in-frame fusion between gp350 and the polyepitope polypeptide. The 
constructs were referred to as gp350/PT26A (Figure 3A) and gp350/PT26B (Figure 3B). 
5 For protein expression testing, transformed cells were grown in 50mg / ml 

Kanamycin containing L broth at 37°C until OD600 reached 2. Protein expression was 
induced by the addition of IPTG (0.25mM) and cells were grown for another 3 hours. Cells 
were pelleted and boiled in SDS sample buffer before analysing by SDS-PAGE. 

For protein purification, cells induced by IPTG were pelleted, resuspended in 
10 binding buffer (20mM Tris-HCl pH 7.9, 0.5M NaCl, 5mM imidazole) and then sonicated. 
Inclusion bodies were pelleted and washed in buffer. The proteins were solubilised 
overnight in binding buffer containing 8M urea and purified on a Ni ++ -NTA column. 

Preparation of ISCOM™ formulations 

15 ISCOMATRDC™-adjuvant was prepared by combining adjuvant components in a 

formulation vessel. Cholesterol, 1,2, dipalmitoyl phosphatidylcholine (DPPC), and 
ISCOPREF™ as a source of purified Quillaja saponins, were mixed in a weight ratio of 1:1:5 
in the presence of the detergent Mega-10 (United States Patent No. 5,679,354) at a 
concentration of 2%. The detergent was removed by diafiltration with PBS and the 

20 formation of ISCOMATRIX™ confirmed by negative contrast electron microscopy 

revealing complexes including cage-like structures typically with a diameter of 40nm. 
ISCOM™-adjuvanted vaccines were prepared by mixing the EBV polyepitope antigen with 
preformed ISCOMATRDC™-adjuvant which was prepared as described below: The dose 
strength of ISCOM-adjuvant as saponin was quantified by reverse phase HPLC assay. 

25 ISCOM™ vaccines were prepared by gentle mixing at 22°C of an equal volume of 

2x final dose strength ISCOMATRIX™ with an equal volume of 2x final dose strength EBV 
polyepitope antigen (gp350-PT26A and B). After 60 minutes, the formulation was 
subjected to extensive dialysis, in order to remove urea, at 4°C into PBS buffer pH6.2 using 
12,000 molecular weight cut off dialysis membrane (Cellu Sep T3, San Antonio Texas). 

30 

Mouse immunogerucity 
Dosing 
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Female HLA A2 transgenic C57B1/6 mice (HDD) were bred at Queensland Institute 
of Medical Research (QIMR, Brisbane, Australia) and immunised at 5-7 weeks of age. Mice 
were housed in filter-topped cages in the PC3 animal facility at QIMR. Groups of 4 or 5 
mice were dosed sub-cutaneously at the tail base with 0.1ml formulation. This was 

5 followed by removal of spleens at day 21 for ex- vivo EUSPOT assay (below). 

Mice dosed sub-cutaneously, received lOug ISCOM™-adjuvant (as saponin) and 
lOug EBV polyepitope antigen. For a vaccine control group, mice were dosed sub- 
cutaneously with a peptide mixture comprising free peptides (Mimotopes Pty Ltd) 
formulated with tetanus toxoid and Montanide ISA 720 (SEPPIC, Paris, France) as 

10. previously described (Elliot et al, 1999). Peptide control immunisations come from two 
groups of mice, one group immunised with GLCTLVAML(SEQ ID 

NO:6)/YLLEMLWRL(SEQ ID NO:9)/LLDFVRFMGV(SEQ ID NO:8) peptide mixture and 
the other YLQ/ CLG. A mixture of all five epitopes showed some insolubility problems. 

15 CTL activity (ex-vivo EUSPOT) 

Ex wVa EUSPOT measures/ quantitates both effector and memory CTL which 
secrete IFNy. Peptide-specific EFNy secreting cells are enumerated by an enzyme linked 
immuno-spot (ELISpot) assay modified from Murali-Krishna et aL Flat bottomed 96-well 
microtitre plates are coated overnight with 5 ug/mL of rat anti-mouse IFNy antibody (clone 

20 RA-6A2, BD PharMingen, San Diego, California, USA). Coated plates are then blocked for 
1 hour with 1% FBS in PBS, and then washed three times with PBS/0.05% Tween 20 (PBS- 
T), and incubated for 1 hour at 37°C with medium comprising RPMI 1640, supplemented 
with 100 ug/ml streptomycin and 100 IU/ml penicillin, 10% FBS and 10' 5 M 2- 
mercaptoethanol. Mouse splenocytes were then treated with red blood cell lysis buffer, 

25 washed and resuspended to 1 x 10 7 cells/ml in medium, for use in the ex vivo IFNy 

ELISPOT assay. Splenocytes (1 x 10 6 / well) are then placed in the first wells of the ELISpot 
plate and serially diluted two fold. Recombinant human IL-2 (kindly provided by Cetus 
Corp., Emeryville, California, USA) is then added to the plate at a final concentration of 5 
IU/well together with EBV peptide (Mimotopes Pty Ltd) at a final concentration of 100 

30 ug/ml. Media containing IL-2 without peptide is added to control wells. The final volume 
in each well is 100 ul. Plates are incubated at 37°C in 5% C0 2 for approximately 18 hours. 
After incubation, cells are lysed by rinsing the plates in H 2 0 and then washed twice in PBS- 
T. Biotinylated anti-mouse IFNy antibody clone XMG1.2 (BD Phar Mingen) is diluted 1:500 



WO 2004/007556 ) PCT/AU 2003/0009 10 

23 

(2 ug /ml final concentration) in PBS-T/5% FBS and added to all wells at 50 ul/ well and 
incubated for 2 hours at RT. Plates are then washed in PBS-T and streptavidin-alkaline 
phosphatase, diluted 1:400 in PBS-T/5% FBS, is added at 50 ul per well and incubated for a 
further 2 hours. After washing, plates are developed by adding Sigma Fast BCIP/NBT 
5 substrate at 50 ul/well. Plates are incubated at 37°C for approximately 20 minutes to allow 
colour development and then washed with water to stop the reaction. EFNy specific spots 
are counted using KS ELISPOT Reader (Zeiss). 

RESULTS 

10 Epitope Fusions 

26 EBV CTL epitopes were selected to provide >90% human population coverage 
for a vaccine formulation. 

In order to link these CTL epitopes (Table 1) together and facilitate the design of a 
polyepitope antigen to form the basis of an EBV prophylactic vaccine, the hydrophobicity 

15 value of each epitope was calculated using Pinsof 1 2 software (Table 2). Two versions of 
26 epitopes were then ordered into configurations PT26A and PT26B (Figure 1), which 
reduced peak hydrophobicity and hydrophobic sequence length (Figure 2). When these 
constructs were cloned for expression in F. coH 'xt was found that only one of the 
configurations (PT26A) was able to produce a polyepitope polypeptide (Figure 4A). PT26B 

20 was not produced (Figure 4C). 

To identify a potential reason for this unexpected finding, local areas of high 
hydrophobicity were examined by summation of overlapping hydrophobicity values 
(Pinsoft 2, Mimotopes Pty Ltd) to provide a hydrophobicity index (HI) over varying 
numbers of peptides in a group (n). For n=2, no correlation was apparent. However, for 

25 n=3 and n=4 (Table 3), the highest HI values for the expressed sequence (PT26A) were 
lower (1.79 and 2.51 respectively) than the highest values obtained for the non-expressed 
sequence (2.02 and 2.54 respectively). For n=5, again no significant differences were seen. 
This would indicate that there were local areas of slightly higher hydrophobicity in PT26B 
than PT26A. 

30 The above analysis can be represented by the mathematical expression:- 

e=m+n-l 

HI m -2Xe 

e=m 
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(where m = group number, n « group size, e = integer (epitope) number, x j= epitope 
hydophobicity value). 

Overall, the results with n=3 and n=4 appeared to show the most distinctive 
differences. This would make the prediction that to be able, to express linked, random, 
5 short amino acid sequences in R coll in SDS-PAGE detectable amounts, the hydrophobic 
index over groups of three epitope sequences would preferably need to be less than 1.8 (HI3 
< 1.8) and /or that over groups of four the hydrophobic index would preferably need to be 
less than 2.5 (HI 4 < 2.5), where x was calculated using Pinsoft 2 (Mimotopes Pty Ltd) and 
specifying the N-terminus as N-acetyl and the C-terminus as carboxy amide. Different cut- 

10 off values will be obtained with different hy drophobicity algorithms. 

Table 4 shows the HI values over 3 peptides (n=3) for 15 random arrangements of 
the 26 EBV CTL epitopes that were generated and analysed by calculating the HI. This 
shows that all 15 random configurations contain multiple HI values which are all in the 
range predicted to preclude the production of a recombinant polypeptide in R coII{ie HI 

15 greater than 1.8). This also shows that the arrangements made in accordance with the 

present methods are unlikely to be arrived at without application of the insights embodied 
in the present invention. 
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Fusions of these poly epitopes with the N- terminal 400 amino acids of a naturally-occurring 
EBV protein (gp350). 

Provision of CD4 help has previously been shown to improve CTL induction (Thuy 
et al, 2001) and the EBV structural protein gp350 was identified as the preferred candidate 
5 ' to provide this property because it would provide cognate help. Hence the two 

polyepitopes, PT26A and PT26B, the latter of which was unable to be expressed in JE coli, 
were cloned onto the C-terminus of the N-terrninal 400 amino acids of gp350 (Figure 3) and 
expressed as the fusion protein in R coli. Both of these fused polypeptides were well- 
produced, being clearly visible on a Coomassie-stained gel above the background of E coli . 
10 proteins (profile for PT26A shown in Figure 4B). 

CTL activity 

Vaccination of HLA A2 transgenic mice with either gp350-PT26A-ISCOM TM vaccine 
or gp350-PT26B-ISCOM™ vaccine induced a CTL immune response (IFN EUSPOT) to all 
15 five A2 epitopes (Figure 5), thus indicating that all A2 epitopes were properly processed 
and presented to the immune system. 

Example 2: EBV polye pitope fusions as nasopharyngeal carcinoma vaccine 
candidates. 

20 MATERIALS AND METHODS 
Epitope sequences 

The 20 CTL epitopes for inclusion in an EBV-NPC vaccine, the proteins from which 
they originate and HLA type are shown in Table 5. 

Design/ Ordering of epitopes 
25 An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 

favourable hydrophobicity characteristics was generated by the method described in 
Example 1 (i) - (xi). 

The final shuffle involved taking YPLHEQHGM (SEQ ID NO:23) from position 3 
and changing with CLGGLLTMV (SEQ ID NO:16) at position 19 to reduce a high 
30 hydrophobic index (HI) which resulted from summing epitopes 18, 19 and 20. 

The ordering process for this optimised EBV-NPC polyepitope (EBV-NPCa) 
configuration is shpwn in Table 6. 
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Hydrophobic Index (HI) calculations 

HI values for EBV-NCPa were calculated according to the mathematical expression:- 

e=m+n-l 
HI m = Ix e 

5 , e=m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value) over values of n from 2 to 5). 

Generation ofDNA constructs 



The amino acid sequence of the EBV-NPC polyepitope was back translated to DNA 




10 using the Dnastar Editseq software (DNASTAR Inc, Madison, Wisconsin, USA) and codons 
optimised for E. coli expression. A C-tenninal hexa-histidine tag was incorporated for 
purification and detection. The DNA encoding the polyepitope was generated from 
synthetic oligonucleotides using a Splicing by Overlap Extension technique (SOE) as 
described by Horton et al (1995). This was cloned into pET24b (Novagen) and the resulting 

15 construct sequenced to ensure that no errors were present (Big Dye Terminator Kit V3.1; 
Applied Biosystems). For expression purposes, DNA was transformed into £. coli 
. BL21(DE3) cells (Novagen). 

Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 

20 Kanamycin. At an OD600 of -2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour ,2 hours, 3 hours and overnight post-induction. Cells 
were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and immunoblotting. Blots were probed with Dianova anti hexa- 

25 histidine monoclonal antibody. 
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TABLE 5: CTL epitopes included in the EBV-NPCa polyepitope 



. HLA Type 


EBV Protein 


Epitope 


All 


LMP2 


SSCSSCPLSKI (SEQ ID NO:12) 


A23, A24 


LMP2 


PYLFWLAAI (SEQ ID NO:13) 


A24 


LMP2 


TYGPVFMCL (SEQ ID NO:14) 


A25 


LMP2 


VMSNTLLSAW (SEQ ID NO:36) 


A2.1 


LMP2 


CLGGLLTMV (SEQ ID NO:6) 


A2.3 


LMP2 


LLSAWILTA (SEQ ID NO:37) 


A2.6 


LMP2 


LTAGFLIFL (SEQ ID NO:38) 


B8 


LMP2 


CPLSKILL (SEQ ID NO:39) 


B27 


LMP2 


RRRWKRLTV (SEQ ID NO:40) 


B40 


LMP2 


IEDPPFNSL (SEQ ID NO:30) 


A2,A68,A69 


LMP1 


YLLEMLWRL (SEQIDNO:9) 


A2 


LMP1 


YLQQNWWTL (SEQ ID NO:10) 


A2 


LMP1 


ALLVLYSFA(SEQ ID NO:41) 


B57,B58 • 


LMP1 


IALYLQQNW (SEQ ID NCh28) 


A3 


EBNA3 


RLRAEAQVK (SEQ ED NO:ll) 


B8 


EBNA3 


QAKWRLQTL (SEQ ID NO:18) 


B35 


EBNA3 


YPLHEQHGM (SEQ ID N023) 


B27 


EBNA4 


HRCQAIRKK (SEQ ID NO:20) 


B62 


EBNA4 


GQGGSPTAM (SEQ ID NO:31) 


B8 


BZLF1 


RAKFKQLL (SEQ ID NO:19) 
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Example 3: HIV polvepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 
Epitope sequences 

The 26 CTL epitopes for inclusion in a HIV vaccine, the proteins from which they 
5 originate and HLA type are shown in Table 7. 

Design/ Ordering of epitopes 

An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 
favourable hydrophobicity characteristics was generated by the method described in 
Examplel (i) - (xi). At the same time, five random sequences were generated and one of 
10 these (HTVb) was taken for.comparispn with the optimised sequence. 

The ordering process for the optimised HIV polyepitope (HIVa) configuration is 
shown in Table 8. 

Hydrophobic Index (HI) calculations 

HI values for both configurations (HIVa and HF/b) were calculated according to 
15 the mathematical expression:- 

e=m+n-l 
M m = £x e 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
20 hydophobicity value) over values of n from 2 to 5). 

Generation ofDNA constructs 

Each polyepitope amino acid sequence was back translated to DNA using the 
Dnastar Editseq software and codorts optimised for R. coli expression. C-terminal hexa- 
histidine tags were incorporated for purification and detection. The DNA encoding the 
25 polyepitopes was generated from synthetic oligonucleotides using a Splicing by Overlap 
Extension technique (SOE) as described by Horton et al (1995). These were cloned into 
pET24b (Novagen) and the resulting constructs sequenced to ensure that no errors were 
present (Big Dye Terminator Kit V3.1; Applied Biosystems). For expression purposes, 
DNA was transformed into R coli BL21(DE3) cells (Novagen). 
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Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 
Kanarhycin. At an OD600 of ~2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour, 2 hours, 3hours and overnight post-induction. Cells 
5 were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and irnmunoblotting. Blots were probed with Dianova anti hexa- 
hisudine monoclonal antibody. 

RESULTS 

10 Epitope fusions 

26 HIV CTL epitopes were selected to provide the components of a vaccine 
formulation. In order to link these CTL epitopes (Table 5) together and facilitate the design 
of a polyepitope antigen to form the basis of a HIV vaccine, the hydrophobicity value of 
each epitope was calculated (as in the case of Example 1) using Pinsoft 2 software. Two 

15 versions of the 26 epitopes were then created (Figure 7), one which reduced peak 
hydrophobicity and hydrophobic sequence length (HIVa) and another in which the 
epitopes were randomly arranged (HIVb). These constructs were then assembled and 
cloned into Ecolitox a comparison of their ability to produce a polyepitope polypeptide. 

HIVa (optimised polyepitope) 

20 Good expression of polyepitope polypeptide at the predicted MW of 29kDa was 

observed on Coomassie stained gels from the 1 hour time point and the protein was 
recognised by an anti hexa-histidine monoclonal antibody on the immunoblot (Figures 9 
and 10). Maximum expression was reached by 1 hour post-induction. Leaky, uninduced 
expression was seen at each time point, gradually increasing to induced levels after 

25 overnight incubation. A potential dimer was visible only by immunoblot after overnight 
induction, with some higher molecular weight material also being detected at 3 hours and 
overnight post-induction timepoints. 

HIVb (random polyepitope) 

No polyepitope polypeptide was detected on the stained gel or by immunoblotting 
30 (Figures 9 & 10). 
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DISCUSSION 

Use of the above algorithm, to order the 26 CTL epitopes has allowed the expression 
of an HTV polyepitope polypeptide in good yields. On the other hand, expression of a 
polyepitope sequence which was randomly ordered failed to yield any detectable product . 
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TABLE 7: CTL epitopes included in the HIV polytopes 



HLA Type 


HIV Protein 


Epitope 


A3 


P 17 


RLRPGGKKK (SEQ ID NO: 42) 


A*2402,A24,A23 


Pl7 . 


KYKLKHTVW (SEQ ID NO: 43) 


B35 


P 17 


WASRELERF (SEQ ID NO: 44) 


B7,B8 


pl7 


RPGGKKKYKL (SEQ ID NO: 45) 


A2,A*0202,B62 


P 17 


SLYNTVATL (SEQ ID NO: 46) 


B8,B60 


P 17 


EIKDTKEAL (SEQ ID NO: 47) 


A2,A*0202 


p24 


TLNAWVKW (SEQ ID NO: 48) 


67342353 


p24 


TPQDLNTML (SEQ ID NO: 49) 


A2 


p24 


MTNNPPIPV (SEQ ID NO: 50) 


B27 


p24 


KRWTILGLNK (SEQ ID NO: 51) 


A28, A74 


Protease 


VTLWQRPLV (SEQ ID NO: 52) 


A2, A*0201 


Protease 


LVGPTPVNI (SEQ ID NO: 53) 


A2,A0201 


Reverse Transcriptase 


ALVEICTEM (SEQ ID NO: 54) 


A2,B51 


Reverse Transcriptase 


YTAFTIPSI (SEQ ID NO: 55) 


A3,AH 


Reverse Transcriptase 


AEFQSSMTK (SEQ ID NO: 56) 


A2,A*0201 


Reverse Transcriptase 


VIYQYMDDL (SEQ ID NO: 57) 


A2,HoV 


Reverse Transcriptase 


KtEELRQHL (SEQ ID NO: 58) 


A2 


Reverse Transcriptase 


ILKEPVHGV (SEQ ID NO: 59) 


All 


Reverse Transcriptase 


IYQEPFKNLK (SEQ ID NO: 60) 


B44 


Reverse Transcriptase 


TWETWWTEYW (SEQ ED NO: 61) 


A2 


Reverse Transcriptase 


PLVKLWYQL (SEQ ID NO: 62) 


B7,A3 


Reverse Transcriptase 


YLAWVPAHK (SEQ ID NO: 63) 


A2,A*0201 


Integrase 


LLWKGEGAV (SEQ ID NO 64) 


A2,A*0201 


Vpr 


AURILQQL (SEQ ID NO: 65) 


B7 


Vpu 


IPIVATVALV (SEQ ID NO: 66) 


A2,A2.1 


gp!60 


KLWVTVYYGV (SEQ ID NO: 67) 
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Example 4: HCV polyepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 
Epitope sequences 

The 26 CTL epitopes for inclusion in a HCV vaccine, the proteins from which they 
originate and HLA type are shown in Table 9. 

Design /Ordering of epitopes 

An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 
favourable hydrophobicity characteristics was generated by the method described in 
Examplel (i) - (xi). The final shuffle involved taking CTCGSSDLY (SEQ ID NO:68) from 
position 3 and changing with FLLLADARV (SEQ.ID NO:69) at position 26 to reduce a high 
hydrophobic index (HI) which resulted from summing epitopes 24, 25 and 26. At the same 
time/five random sequences were generated and one of these (HCVb) was taken for 
comparison with the optimised sequence. 

The ordering process for the optimised HCV polyepitope (HCVa) configuration is 
shown in Table 10. 

Hydrophobic Index (HI) calculations 

HI values for both configurations (HCVa and HCVb) were calculated according to 
the mathematical expression:- 

e=m+n-l 
HI^Zx. 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x « epitope 
hydophobicity value) over values of n from 2 to 5). 

Generation ofDNA constructs 

Each polyepitope amino acid sequence was back translated to DNA using the 
Dnastar Editseq software and codons optimised for E coli expression. C-terminal hexa- 
histidine tags were incorporated for purification and detection. The DNA encoding the 
polyepitope sequences was generated from synthetic oligonucleotides using a Splicing by 
Overlap Extension technique (SOE) as described by Horton et al (1995). These were cloned 
into pET24b (Novagen) and the resulting constructs sequenced to ensure that no errors 



WO 2004/007556 



41 



PCT/AU2003/000910 



were present (Big Dye Terminator Kit V3.1; Applied Biosystems). For expression purposes, 
DNA was transformed into E coli BL21(DE3) cells (Novagen). 

Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 
Kanamycin. At an OD600 of ~2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour, 2 hours, 3 hours and overnight post-induction. Cells 
were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and immunoblotting. Blots were probed with Dianova anti hexa- 
histidine monoclonal antibody i 

RESULTS 

Epitope fusions 

26 HCV CTL epitopes were selected to provide the components of a vaccine 
formulation. In order to link these CTL epitopes (Table 7) together and facilitate the design 
of a polyepitope polypeptide to form the basis of a HCV vaccine, the hydrophobirity value 
of each epitope was calculated (as in the case of Example 1) using Pinsoft 2 software. Two 
versions of the 26 epitopes were then created (Figure 8), one which reduced peak 
hydrophobicity and hydrophobic sequence length (HCVa) and another in which the 
epitopes were randomly arranged (HCVb). These constructs were then assembled and 
cloned into E coli for a comparison of their ability to produce a polyepitope polypeptide. 

HCVa (optimised polyepitope) 

While induced monomelic polyepitope polypeptide was not visible on Coomassie 
stained gels, protein of the predicted MW of 27.5kDa was detected at 2 and 3 hours post- 
induction when immunoblots were probed with the anti hexa-histidine antibody (Figures 9 
and 10). Over time there was an increasing presence of high molecular weight products as 
a smear, indicating protein aggregation. After overnight induction these high molecular 
weight aggregates were also visible on the Coomassie stained gel (Figure 10). 

HCVb (random polyepitope) 

There was no detection of any induced monomelic polyepitope polypeptide either 
on Coomassie stained gels or by immunoblotting (Figures 9 and 10). Two faint bands were 
detected by immunoblotting after overnight induction, however these were at -24 and 
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30kDa and were dearly not the predicted product of MW 27.5kDa. They most likely 
corresponded to two histidine-rich proteins of E.coli, rotamase (~23kDa; NCBI Accession 
NP_417808) and a protein of unknown function (~32kDa; NCBI Accession BAA15973). 
Since the former is a peptidyl-prolyl-isomerase involved in protein folding, it is perhaps 
5 not surprising that this should be overproduced in this situation. 

DISCUSSION 

In designing a HCV polyepitope, use of the above algorithm to order the 26 CTL 
epitopes has allowed the polypeptide sequence to be expressed, mainly as an aggregated 
product. On the other hand, expression of the polyepitope sequence which was randomly 
10 ordered failed to yield any detectable product. 
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TABLE 9: CTL epitopes included in the HCV polytopes 



HLAType 


HCV Protein 


Epitope 


A2 


NS1/E2 


FLLLADARV (SEQ ID NO: 69) 


A2 


NS4 


YLVAYQATV (SEQ ID NO: 70) 


A2 


NS5 


RLIVFPDLGV (SEQ ID NO: 71) 


A2 


CORE 


DLMGYIPLV (SEQ ID NO: 72) 


A2 


NS4 


WMNRLIAFA (SEQ ID NO: 73) 


A2 


NS4 


VLVGGVLAA (SEQ ID NO: 74) 


A2 


NS4 


HMWNFISGI (SEQ ID NO: 75) 


A2 


NS4 


ILAGYGAGV (SEQ ID NO: 76) 


A2 


CORE 


YLLPRRGPRL (SEQ ID NO: 77) 


A2 


NS1/E2 ; 


LLFLLLADA (SEQ ID NO: 78) 


A2 


NS3 


YLVTRHADV (SEQ ID NO: 79) 


A3 


CORE 


Kf SERSQPR (SEQ ID NO 80) 


A3 


CORE 


RLGVRATRK (SEQ ID NO: 81) 


A3 


ENV 


QLFTFSPRR (SEQ ID NO 82) 


A3 


NS1/E2 


RMYVGGVEHR (SEQ ID NO: 83) 


! A3 


NS3 


LIFCHSKKK (SEQ ID NO: 84) 


A3 


' NS4 


GVAGALVAFK (SEQ ID NO: 85) 


A3 


NS4 


VAGALVAFK (SEQ ED NO: 86) 


A3 


NS3 


TLGFGAYMSK (SEQ ID NO: 87) 


B7. 


CORE 


LPGCSFSIF (SEQ ID NO: 88) 


Al 


NS5 


LSAFSLHSY (SEQ ID NO: 89) 


Al 


NS3 


CTCGSSDLY (SEQ ID NO: 68) 


A24 


NS4B 


FWAKHMWNF (SEQ ID NO: 90) 


A31 


NS5 


VGIYLLPNR (SEQ ID NO: 91) 


A2 


NS4 


LLFNILGGWV (SEQ ID NO: 92) 


B7 


NS3 


IPFYGKAI (SEQ ID NO: 93) 
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It will be appreciated by persons skilled in the art that numerous variations and /or 
modifications may be made to the invention as shown in the specific embodiments without 
departing from the spirit or scope of the invention as broadly described. The present 
embodiments are, therefore, to be considered in all respects as illustrative and not 
restrictive. 
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Claims: 

1. A method for designing a candidate polypeptide for expression in a suitable host, 
said method comprising, 

identifying one or more hydrophobic peptide sequences within a polypeptide of 
5 interest, and 

arranging or re-locating at least one of said hydrophobic peptide sequences within 
said polypeptide so as to generate said candidate polypeptide with reduced amplitude in 
hy drophobicity and / or length o£ any hydrophobic region(s). 

2. The method of claim 1, wherein the polypeptide of interest is to be expressed in a 
10 bacterial host. 

3. The method of claim 1 or 2, wherein the bacterial host is B. coIL 

4. The method of claim 3, wherein die polypeptide is non-native to R coli. 

5. The method of any one of the preceding claims, wherein the polypeptide of interest 
comprises a non-natural polypeptide or a theoretical non-natural polypeptide. 

15 6. The method of claim 5, wherein the polypeptide comprises a polyepitope 
polypeptide comprising a tandem array of epitopes of interest 

7. A method for designing a candidate polyepitope polypeptide comprising a tandem 
array of epitopes for expression in a suitable host, said method comprising, 
20 identifying the relative hydrophobicity of each of said epitopes, 

dividing said epitopes on the basis of said identified hydrophobicities into groups 
of substantially-equivalent numbers, said groups comprising at least a first group of 
epitopes of most relative hydrophobicity and a second group of epitopes of least relative 
hydrophobicity, and 

25 arranging epitopes from said first and second groups in a substantially alternating 

manner so as to generate said candidate polyepitope polypeptide with reduced amplitude 
in hydrophobicity and/ or length of any hydrophobic region(s). 
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8. A method for designing a candidate polyepitope polypeptide comprising a tandem 
array of epitopes for expression in a suitable host, said method comprising, 

identifying the relative hydrophobicity of each of said epitopes, 
dividing said epitopes on the basis of said identified hydrophobicities into three 
5 groups of substantially equivalent numbers, said groups comprising a first group of 

epitopes of most hydrophobicity, a second group of epitopes of intermediate relative 

hydrophobicity, and a third group of epitopes of least relative hydrophobicity, 

arranging epitopes from said first, second and third groups into triplets containing 

an epitope from each group, and 
10 arranging said triplets in a linked sequence so as to generate said candidate 

polyepitope polypeptide with reduced amplitude in hydrophobicity and /or length of any 

hydrophobic region(s). 

9. A method for designing a candidate polyepitope polypeptide comprising a tandem 
15 array of epitopes for expression in a suitable host, said method comprising, 

identifying the relative hydrophobicity of each of said epitopes, 

dividing said epitopes on the basis of said identified hydrophobicities into four 
groups of substantially equivalent numbers, said groups comprising a first group of 
epitopes of most hydrophobicity, a second group of epitopes of lesser relative 
20 hydrophobicity, a third group of epitopes of even lesser relative hydrophobicity, and a 
fourth group of least relative hydrophobicity 

arranging epitopes from said first, second and third groups into quadruplets 
containing an epitope from each group, and 

arranging said quadruplets in a linked sequence so as to generate said candidate 
25 polyepitope polypeptide with reduced amplitude in hydrophobicity and /or length of any 
hydrophobic region(s). 

10. The method of any one of claims 7 to 9, wherein the polyepitope polypeptide 
comprises 5 to 100 epitopes. 

11. The method of any one of claims 7 to 9, wherein the polyepitope polypeptide 
30 comprises 10 to 35 epitopes. 
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12. The method of any one of claims 7 to 11, wherein the epitopes comprising the 
polyepitope polypeptide are selected from epitopes of any one of the viruses of the group 
consisting of Epstein-Barr virus (EBV), hepatitis C virus (HCV), human immunodeficiency 
virus (HIV) and cytomegalovirus (CMV). 

5 

13. A method of expressing a polypeptide in a suitable host said method comprising, 
designing a polypeptide in accordance with the method of any one of claims 1 to 12, 
introducing a polynucleotide encoding said polypeptide into said host, such that 

said host is capable of expressing said polypeptide, and 
10 • culturing said bacterial host under conditions suitable for expression of said 

polypeptide. 

14. A polypeptide designed in accordance with the method of any one of claims 1 to 5. 

15. A polyepitope polypeptide designed in accordance with the method of any one of 
15 claims 1 to 12. 

16. A polyepitope polypeptide comprising N epitopes, wherein N is any integer, said 
polyepitope polypeptide having the formula; 

Triplet 1 - Triplet 2 - .... - Triplet N/3, 
20 wherein each of said triplets comprises three linked epitopes selected by, 

identifying and ranking the relative hydrophobicity of each of the N epitopes, 
grouping the ranked N epitopes into three groups of substantially equivalent 
numbers, based upon the identified relative hydrophobicity of the N epitopes, to produce a 
first group comprising the epitopes of most relative hydrophobicity, a second group of 
25 epitopes of intermediate relative hydrophobicity, and a third group of epitopes of least 
relative hydrophobicity, and 

selecting the epitopes for each of said triplets according to the following table: 
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Epitope 1 


Epitope 2 


■ Epitope 3 


Triplet 1 (N-terminal) 


Most hydrophilic of 
Group 2 


Most hydrophobic of 
Group 1 


Most hydrophilic of 
Group 3 


Triplet 2 


2 nd most hydrophilic 
of Group 2 


2 nd most hydrophobic 
of Group 1 


2 nd most hydrophilic 
of Group 3 










Triplet N/3 (C- 
terminal) 


Most hydrophobic of 
Group 2 


Most hydrophilic of 
Group 1 


Most hydrophobic of 
Group 3 



17. The polyepitope polypeptide of claim 15 or 16, wherein the epitopes are contiguous 
or spaced apart by intervening sequences which are substantially free of sequences which 

5 naturally flank said epitopes. 

18. A polypeptide vaccine comprising a polyepitope polypeptide according to any one 
of claims 14 to 17 and a pharmaceutically acceptable carrier and/ or adjuvant. 

19. A polyepitope polypeptide comprising an amino acid sequence substantially 
10 corresponding to an amino acid sequence selected from the group consisting of: 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - VQPPQLTLQV- 
GLCTLVAML - RLRAEAQVK -1EDPPFNSL -YLLEMLWRL - GQGGSPTAM - 
AVLLHEESM - IALYLQQNWWTL-RAKFKQLL - SSCSSCPLSKI- TYGPVFMCL- 
15 ■ QAKWRLQTL - RPPIFIRRL- VSFIEFVGW - YPLHEQHGM -VEITPYKPTW - 

CLGGLLTMV - EENLLDFVRF - TYSAGIVQI - LLDFVRFMGV -EGGVGWRHW 
(SEQ ID NO:l), 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - GLCTLVAML - 
20 RLRAEAQVK - IEDPPFNSL -TYSAGIVQI - LLDFVRFMGV - EGGVGWRHW - 

IALYLQQNWWTL - RAKFKQLL - SSCSSCPLSKI - TYGPVFMCL - QAKWRLQTL- 
RPPIFIRRL - VSFIEFVGW - YPLHEQHGM - VEITPYKPTW - CLGGLLTMV - 
EENLLDFVRF - YLLEMLWRL - GQGGSPTAM - AVLLHEESM - VQPPQLTLQV 
(SEQIDNO-.2), 
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SSCSSCPLSKI - HRCQAIRKK - CLGGLLTMV - LTAGFUFL - RLRAEAQVK - 
IEDPPFNSL - LLSAWILTA - RRRWRRLTV - PYLFWLAAI - YLLEMLWRL - 
GQGGSPTAM - VMSNTLLSAW - ALLVLYSFA - RAKFKQLL - IALYLQQNW - 
TYGPVFMCL - QAKWRLQTL - YLQQNWWTL - YPLHEQHGM - CPLSKILL 
(SEQIDNO:3), 

IPrVAT/ALV - RLRPGGKKK - ILKEPVHGV - PLVKLWYQL - RPGGKKKYKL - 
KYKLKHTVW - TWETWWTEYW - EIKDTKEAL - KRWIILGLNK - 
KLWVTVYYGV - KIEELRQHL - MTNNPPIPV - VTLWQRPLV - WASRELERF - 
LLWKGEGAV - YTAFTIPSI - IYQEPFKNLK - SLYNTVATL - AURILQQL - 
AIFQSSMTK - VIYQYMDDL - LVGPTPVNI - TPQDLNTML - YLAWVPAHK - 
ALVEICTEM - TLNAWVKW (SEQ ID NO:4), 

and 

LLFNILGGWV - KTSERSQPR - FLLLADARV - LLPLLLAD A - RLGVRATRK - 
GVAGALVAFK - LPGCSFSEF - RMYVGGVEHR - VAGALVAFK - DLMGYIPLV 
LIFCHSKKK - ILAGYG AGV - HMWNFISGI - QLFTFSPRR - VGIYLLPNR - 
FWAKHMWNF - YLVTRHADV - LSAFSLHSY - WMNRLIAFA - YLLPRRGPRL 
YLVAYQATV - RUVFPDLGV - TLGFGAYMSK - IPFYGKAI - VLVGGVLAA - 
CTCGSSDLY (SEQ E) NO:5). 

20. The polyepitope polypeptide of claim 19, wherein the polyepitope polypeptide 
comprises an amino acid sequence substantially corresponding to: 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - VQPPQLTLQV- 
GLCTLVAML - RLRAEAQVK -IEDPPFNSL -YLLEMLWRL - GQGGSPTAM - 
AVLLHEESM - IALYLQQNWWTL-RAKFKQLL - SSCSSCPLSKI- TYGPVFMCL- 
QAKWRLQTL - RPPEFIRRL- VSFIEFVGW - YPLHEQHGM -VEITPYKPTW - 
CLGGLLTMV - EENLLDFVRF - TYSAGIVQI - LLDFVRFMGV -EGGVGWRHW 
(SEQIDNO:!). 
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21. The polyepitope polypeptide of claim 19, wherein the polyepitope polypeptide 
comprises an amino acid sequence substantially corresponding to: 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - GLCTLVAML - 
5 RLRAEAQVK - IEDPPFNSL -TYSAGIVQI - LLDFVRFMGV - EGGVGWRHW - 

IALYLQQNWWTL - KAKFKQLL - SSCSSCPLSKI - TYGPVFMCL - QAKWRLQTL- 
RPPIFTRRL - VSFEEFVGW - YPLHEQHGM - VEITPYKPTW - CLGGLLTMV - 
EENLLDFVRF - YLLEMLWRL - GQGGSPTAM - A VLLHEESM - VQPPQLTLQ V 
(SEQIDNO:2). 

10 

22. The polyepitope polypeptide of claim 19, wherein the polyepitope polypeptide 
comprises an amino acid sequence substantially corresponding to: 

SSCSSCPLSKI - HRCQAIRKK - CLGGLLTMV - LTAGFLIFL - RLRAEAQVK - 
15 IEDPPFNSL - LLSAWILTA - RRRWRRLTV - PYLFWLAAI - YLLEMLWRL - 

GQGGSPTAM - VMSNTLLSAW - ALLVLYSFA - RAKFKQLL - IALYLQQNW - 
TYGPVFMCL - QAKWRLQTL - YLQQNWWTL - YPLHEQHGM - CPLSKTLL 
(SEQIDNO:3). 

20 23. The polyepitope polypeptide of claim 19, wherein the polyepitope polypeptide 
comprises an amino acid sequence substantially corresponding to: 

IPIVAIVALV - RLRPGGKKK - ILKEPVHGV - PLVKLWYQL - RPGGKKKYKL - 
KYKLKHTVW - TWETWWTEYW - FJKDTKEAL - KRWELGLNK - 
25 KLWVTVYYGV - KIEELRQHL - MTNNPPIPV - VTLWQRPLV - WASRELERF - 

LLWKGEGAV - YTAFTTPS1 - IYQEPFKNLK - SLYNTVATL - AHRILQQL - 
AIFQSSMTK - VIYQYMDDL - LVGFTPVNI - TPQDLNTML - YLAWVPAHK - 
ALVEICTEM - TLNAWVKW (SEQ ID NO:4). 

30 24. The polyepitope polypeptide of claim 19, wherein the polyepitope polypeptide 
comprises an amino acid sequence substantially corresponding to: 
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LLFNILGGWV - KTSERSQPR - FLLLADARV - LLFLLLADA -RLGVRATRK - 
G V AGALVAFK - LPGCSFSIF - RMYVGGVEHR - VAGALVAFK - DLMGYIPLV - 
UFCHSKKK - ILAG YG AG V - HMWNFISGI - QLFTFSPRR - VGIYLLPNR ~ 
FWAKHMWNF - YLVTRHADV - LSAFSLHSY - WMNRLIAFA - YLLPRRGPRL - 
5 YLVAYQATV - RLF/FPDLGV - TLGFGAYMSK - IPFYGKAI - VLVGGVLAA - 

CTCGSSDLY (SEQ ID NO:5). 

25. A polypeptide vaccine comprising a polyepitope polypeptide according to any one 
of claims 19 to 24 and a pharmaceutical^ acceptable carrier and/ or adjuvant. 

26. ' A viral or DNA vaccine comprising a polynucleotide encoding a polypeptide 
designed in accordance with the method of any one of claims 1 to 12 and a 
pharmaceutical^ acceptable carrier and /or adjuvant. 

27. A viral or DNA vaccine comprising a polynucleotide encoding a polypeptide 

15 according to any one of claims 19 to 24 and a pharmaceutical^ acceptable carrier and / or 
adjuvant 
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AMENDED CLAIMS 

[(received by the International Bureau on 12 November 2003 (12.1 1.03); 
original claim 13 amended; remaining claims unchanged (1 page)] 



12. The method of any one of claims 7 to 11, wherein the epitopes comprising the 
polyepitope polypeptide are selected from epitopes of any one of the viruses of the group 
consisting of Epstein-Barr virus (EBV), hepatitis C virus (HCV), human immunodeficiency 
virus (HIV) and cytomegalovirus (CMV). 

5 

13. A method of expressing a polypeptide in a suitable host, said method comprising, 
designing a polypeptide in accordance with the method of any one of claims 1 to 12, 
introducing a polynucleotide encoding said polypeptide into said host, such that 

said host is capable of expressing said polypeptide, and 
10 culturing said host under conditions suitable for expression of said polypeptide. 

14. A polypeptide designed in accordance with the method of any one of claims 1 to 5. 

15. A polyepitope polypeptide designed in accordance with the method of any one of 



15 



claims 1 to 12. 



16. A polyepitope polypeptide comprising N epitopes, wherein N is any integer, said 
polyepitope polypeptide having the formula; 

Triplet 1 - Triplet 2 - .... - Triplet N/3, 
wherein each of said triplets comprises three linked epitopes selected by, 
20 identifying and tanking the relative hydrophobicity of each of the N epitopes, 

grouping the ranked N epitopes into three groups of substantially equivalent 
numbers, based upon the identified relative hydrophobicity of the N epitopes, to produce a 
first group comprising the epitopes of most relative hydrophobicity, a second group of 
epitopes of intermediate relative hydrophobicity, and a third group of epitopes of least 
25 relative hydrophobicity, and 

selecting the epitopes for each of said triplets according to the following table: 
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Sequence Listing: 



<110> CSL Ltd and The Council of the Queensland Institute of 

Medical Research 

<120> Expression of hydrophobic proteins 

<130> 13455734 

<150> AU2002950183 

<151> 2002-07-12 

<160> 102 

<170> Patentln version 3.1 

<210> 1 

<211> 233 

<212> PRT 

<2t3> epstein-barr virus (EBV) 

<400> 1 

Phe Leu Arg Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe Trp Leu Ala 
1 5 10 15 

Ala lie His Arg Cys Gin Ala lie Arg Lys Lys Arg Arg lie Tyr Asp 
20 25 " 30 

Leu lie Glu Leu Val Gin Pro Pro Gin Leu Thr Leu Gin Val Gly Leu 
35 40 45 

Cys Thr Leu Val Ala Met Leu Arg Leu Arg Ala Glu Ala Gin Val Lys 
50 55 60 

lie Glu Asp Pro Pro Phe Asn Ser Leu Tyr Leu Leu Glu Met Leu Trp 
65 70 75 80 

Arg Leu Gly Gin Gly Gly Ser Pro Thr Ala Met Ala Val Leu Leu His 
85 90 95 

Glu Glu Ser Met He Ala Leu Tyr Leu Gin Gin Asn Trp Trp Thr Leu 
100 105 110 

Arg Ala Lys Phe Lys Gin Leu Leu Ser Ser Cys Ser Ser Cys Pro Leu 
115 120 125 

Ser Lys He Thr Tyr Gly Pro Val Phe Met Cys Leu Gin Ala Lys Trp 
130 135 140 

Arg Leu Gin Thr Leu Arg Pro Pro He Phe He Arg Arg Leu Val Ser 
145 150 155. 160 

Phe He Glu Phe Val Gly Trp Tyr Pro Leu His Glu Gin His Gly Met 
165 170 175 

Val Glu He Thr Pro Tyr Lys Pro Thr Trp Cys Leu Gly Gly Leu Leu 
180 185 190 

Thr Met Val Glu Glu Asn Leu Leu Asp Phe Val Arg Phe Thr Tyr Ser 
195 200 205 

Ala Gly He Val Gin He Leu Leu Asp Phe Val Arg Phe Met Gly Val 
210 215 220 

Glu Gly Gly Val Gly Trp Arg His Trp 
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225 230 

<210> 2 

<211> 233 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 2 

Phe Leu Arg Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe Trp Leu Ala 
1 5 10 15 

Ala lie His Arg Cys Gin Ala lie Arg Lys Lys Arg Arg He Tyr Asp 
20 25 30 

Leu He Glu Leu Gly Leu Cys Thr Leu Val Ala Met Leu Arg Leu Arg 
35 40 45 

Ala Glu Ala Gin Val Lys He Glu Asp Pro Pro Phe Asn Ser Leu Thr 
50 55 60 

Tyr Ser Ala Gly He Val Gin He Leu Leu Asp Phe Val Arg Phe Met 
65 70 . 75 80 

Gly Val Glu Gly Gly Val Gly Trp Arg His Trp He Ala Leu Tyr Leu 

85 .90 95 

Gin Gin Asn Trp Trp Thr Leu Arg Ala Lys Phe Lys Gin Leu Leu Ser 
100 105 110 

Ser Cys Ser Ser Cys Pro Leu Ser Lys He Thr Tyr Gly Pro Val Phe 
115 120 125 

Met Cys Leu Gin Ala Lys Trp Arg Leu Gin Thr Leu Arg Pro Pro He 
130 135 140 

Phe He Arg Arg Leu Val Ser Phe lie Glu Phe Val Gly Trp Tyr Pro 
145 150 155 160 

Leu His Glu Gin His Gly Met Val Glu He Thr Pro Tyr Lys Pro Thr 
165 170 175 

Trp Cys Leu Gly Gly Leu Leu Thr Met Val Glu Glu Asn Leu Leu Asp 
180 185 190 

Phe Val Arg Phe Tyr Leu Leu Glu Met Leu Trp Arg Leu Gly Gin Gly 
195 200 205 

Gly Ser Pro Thr Ala Met Ala Val Leu Leu His Glu Glu Ser Met Val 
210 215 220 

Gin Pro Pro Gin Leu Thr Leu Gin Val 
225 230 

<210> 3 
<211> 181 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 3 

Ser Ser Cys Ser Ser Cys Pro Leu Ser Lys He His Arg Cys Gin Ala 
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10 



15 



He Arg Lys Lys Cys Leu Gly Gly Leu Leu Thr Met Val Leu Thr Ala 
20 25 30 

Gly Phe Leu He Phe Leu Arg Leu Airg Ala Glu Ala Gin Val Lys He 
35 40 45 

Glu Asp Pro Pro Phe Asn Ser Leu Leu Leu Ser Ala Trp He Leu Thr 
50 55 60 

Ala Arg Arg Arg Trp Arg Arg Leu Thr Val Pro Tyr Leu Phe Trp Leu 
65 70 75 80 

Ala Ala He Tyr Leu Leu Glu Met Leu Trp Arg Leu Gly Gin Gly Glv 
85 90 95 

Ser Pro Thr Ala Met Val Met Ser Asn Thr Leu Leu Ser Ala Trp Ala 
100 . 105 no 

Leu Leu Val Leu Tyr Ser Phe Ala Arg Ala Lys Phe Lys Gin Leu Leu 
115 120 125 

He Ala Leu Tyr Leu Gin Gin Asn Trp Thr Tyr Gly Pro Val Phe* Met 
130 135 140 

Cys Leu Gin Ala Lys Trp Arg Leu Gin Thr Leu Tyr Leu Gin Gin Asn 
145 150 155 160 

Trp Trp Thr Leu Tyr Pro Leu His Glu Gin His Gly Met Cys Pro Leu 
165 170 i 7 5 

Ser Lys He Leu Leu 
180 

<210> 4 
<211> 240 
<212> PRT. 

<213> Human immunodeficiency virus 
<400> 4 

He Pro He Val Ala He Val Ala Leu Val Arg Leu Arg Pro Gly Glv 
15 io 15 

u Lys Glu Pro Val His Gly Val Pro Leu Val Lys 
25 30 

u Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
40 45 

s He Val Trp Thr Trp Glu Thr Trp Trp Thr Glu 
55 60 

Tyr Trp Glu He Lys Asp Thr Lys Glu Ala Leu Lys Arg Trp lie He 
65 70 75 80 

Lys Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Lys 
85 90 95 



He 
1 


Pro 


He 


Val 


Lys 


Lys 


Lys 


He 
20 


Leu 


Trp 


Tyr 
35 


Gin 


Tyr 


Lys 
50 


Leu 


Lys 


Tyr 
65 


Trp 


Glu 


He 


Leu 


Gly Leu Asn 


He 


Glu 


Glu 


Leu 
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100 105 110 

Val Val Thr Leu Trp Gin Arg Pro Leu Val Trp Ala Ser Arg Glu Leu 
115 120 125 

Glu Arg Phe Leu Leu Trp Lys Gly Glu Gly Ala Val Tyr Thr Ala Phe 
130 135 140 

Thr lie Pro Ser lie lie Tyr Gin Glu Pro Phe Lys Asn Leu Lys Ser 
145 150 155 160 

Leu Tyr Asn Thr Val Ala Thr Leu Ala He He Arg He Leu Gin Gin 
165 170 175 

Leu Ala He Phe Gin Ser Ser Met Thr Lys Val He Tyr Gin Tyr Met 
180 . 185 190 

Asp Asp Leu Leu Val Gly Pro Thr Pro Val Asn He Thr Pro Gin Asp 
195 200 205 

Leu Asn Thr Met Leu Tyr Leu Ala Trp Val Pro Ala His Lys Ala Leu 
210 215 220 

Val Glu He Cys Thr Glu Met Thr Leu Asn Ala Trp Val Lys Val Val 
225 230 235 240 

<210> 5 
<211> 239 
<212> PRT 

<213> Hepatitis C virus 
<400> 5 

Leu Leu Phe Asn He Leu Gly Gly Trp Val Lys Thr Ser Glu Arg Ser 
15 10 15 

Gin Pro Arg Phe Leu Leu Leu Ala Asp Ala Arg Val Leu Leu Phe Leu 
20 25 30 

Leu Leu Ala Asp Ala Arg Leu Gly Val Arg Ala Thr Arg Lys Gly Val 
35 40 45 

Ala Gly Ala Leu. Val Ala Phe Lys Leu Pro Gly Cys Ser Phe Ser He 
50 55 60 

Phe Arg Met Tyr Val Gly Gly Val Glu His Arg Val Ala Gly Ala Leu 
65 70 75 80 

Val Ala Phe Lys Asp Leu Met dly Tyr He Pro Leu Val Leu He Phe 
85 90 95 

Cys His Ser Lys Lys Lys He Leu Ala Gly Tyr Gly Ala Gly Val His 
100 105 ' 110 

Met Trp Asn Phe lie Ser Gly He Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg Val Gly He Tyr Leu Leu Pro Asn Arg Phe Trp Ala Lys His Met 
130 135 140 

Trp Asn Phe Tyr Leu Val Thr Arg His Ala Asp Val Leu Ser Ala Phe 
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145 150 155 160 

Ser Leu His Ser Tyr Trp Met Asn Arg Leu lie Ala Phe Ala Tyr Leu 
165 170 175 

Leu Pro Arg Arg Gly Pro Arg Leu Tyr Leu Val Ala Tyr Gin Ala Thr 
180 185 190 

Val Arg Leu lie Val Phe Pro Asp Leu Gly Val Thr Leu Gly Phe Gly 
195 200 205 

Ala Tyr Met Ser Lys He Pro Phe Tyr Gly Lys Ala He Val Leu Val 
210 215 220 

Gly Gly Val Leu Ala Ala Cys Thr Cys Gly Ser Ser Asp Leu Tyr 
225 230 235 

<210> 6 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 6 

Cys Leu Gly Gly Leu Leu Thr Met Val 
1 5 

<210> 7 

<211>- 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 7 

Gly Leu Cys Thr Leu Val Ala Met Leu 
1 5 

<210> 8 

<211> 10 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 8 

Leu Leu Asp Phe Val Arg Phe Met Gly Val . 
1 5 10. 

<210> 9 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 9 

Tyr Leu Leu Glu Met Leu Trp Arg Leu 

1 5 

<210> 10 
<211> 9 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 10 

Tyr Leu Gin Gin Asn Trp Trp Thr Leu 
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<210> 11 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 11 

Arg Leu Arg Ala Glu Ala Gin Val Lys 
1 5 

<210> 12 
<211> 11 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 12 

Ser Ser Cys Ser Ser Cys Pro Leu Ser Lys lie 
15 10 

<210> 13 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 13 

Pro Tyr Leu Phe Trp Leu Ala Ala lie 
1 " 5 

<210> 14 

<211> 9 

<212> PRT 

<213> epstein-barr. virus (EBV) 

<400> 14 

Thr Tyr Gly Pro Val Phe Met Cys Leu 
1 " 5 

<210> 15 
<211> 9 
<212> ' PRT 

<213> epstein-barr virus (EBV) 
<400> 15 

Thr Tyr Ser Ala Gly lie Val Gin lie 
1 ~ 5 

<210> 16 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 16 

Arg Pro Pro He Phe He Arg Arg Leu 
1 5 



<210> 17 
<211> 9 
<212> PRT 
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<213> epstein-barr virus (EBV) 
.<400> 17 

Phe Leu Arg Gly Arg Ala Tyr Gly Leu 
1 5 

<210> 18 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 18 

Gin Ala Lys Trp Arg Leu Gin Thr .Leu 
1 5 

<210> 19 

<211> 8 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 19 

Arg Ala Lys Phe Lys Gin Leu Leu 
1 5 

<210> 20 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 20 

His Arg Cys Gin Ala He Arg Lys Lys 
1 5 

<210> 21 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 21 

Arg Arg lie Tyr Asp Leu He Glu Leu 
1 * 5 

<210> 22 . 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 22 

Ala Val Leu Leu His Glu Glu Ser Met 
1 5 

<210> 23 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 23 

Tyr Pro Leu His Glu Gin His Gly Met 
1 5 
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<210> 24 
<211> 10 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 24 

Glu Glu Asn Leu Leu Asp Phe Val Arg Phe 
r 5 .10 

<210> 25- 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 25 

Glu Gly Gly Val Gly Trp Arg His Trp 
1 5 

<210> 26 
<2U> 10 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 26 

Val Glu lie Thr Pro Tyr Lys Pro Thr Trp 
15 10 

<210> 27 
<211> 10 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 27 

Val Gin Pro Pro Gin Leu Thr Leu Gin Val 
15 10 

<210> 28 
<211> 9 
<212> . PRT 

<213> epstein-barr virus (EBV) 
<400> 28 

lie Ala Leu Tyr Leu Gin Gin Asn Trp 
1 5 

<210> 29 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 29 

Val Ser Phe He Glu Phe Val Gly Trp 
1 5 

<210> 30 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 30 
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lie Glu Asp Pro Pro Phe Asn Ser Leu 
1 5 

<210> 31 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 31 

Gly Gin Gly Gly Ser Pro Thr Ala Met 
1 5 

<210> 32 
<211> 30 
<212> DNA 

<213> epstein-barr virus (EBV) 
<400> 32 

agggatccca tggaagatcc tggttttttc 30 



<210> 33 
<211> 31 
<212> DNA 

<213> epstein-barr virus (EBV) 
<400> 33 

tctagaggtc gacacctgtc gttgtattgg g 31 



<210> 34 

<211> 12 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 34 

He Ala Leu Tyr Leu Gin Gin Asn Trp Trp Thr Leu 
1 5 10 

<210> 35 

<211> 6 

<212> PRT 

<213> Hexa-histidine tag - 

<400> 35 

His His His His His His 
1 5 

<210> 36 
<211> 10 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 36 

Val Met Ser Asn Thr Leu Leu Ser Ala Trp 
15 10 

<210> 37 
<211> 9 
<212> PRT 
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<213> epstein-barr virus (EBV) 
<400> 37 

Leu Leu Ser Ala Trp lie Leu Thr Ala 
1 5 

<210> 38 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 38 

Leu Thr Ala Gly Phe Leu He Phe Leu 
1 5 

<210> 39 

<211> 8 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 39 

Cys Pro Leu Ser Lys He Leu Leu 
1 5 



<210> 40 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 40 

Arg Arg Arg Trp Arg Arg Leu Thr Val 
1 5 



<210> 41 

<211> 9 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 41 

Ala Leu Leu Val Leu Tyr Ser Phe Ala 
1 5 

<210> 42. 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 42 

Arg Leu Arg Pro Gly Gly Lys Lys Lys 
1 5 

<210> 43 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 43 



Lys Tyr Lys Leu Lys His He Val Trp 
1 5 



WO 2004/007556 



PCT/AU2003/000910 



11/28 

<210>. 44 
<211> 9 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 44 

Trp Ala Ser Arg Glu Leu Glu Arg Phe 
1 5 

<210> 45 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 45 

Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu 
15 10 

<210> 46 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 46 

Ser Leu Tyr Asn Thr Val Ala Thr Leu 
1 5 

<210> 47 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 47 

Glu lie Lys Asp Thr Lys Glu Ala Leu 
1 5 

<210> 48 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 48 

Thr Leu Asn Ala Trp Val Lys Val Val 
1 5 

<210> 49 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 49 

Thr Pro Gin Asp Leu Asn Thr Met Leu 
1 5 

<210> 50 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 50 
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Met Thr Asn Asn Pro Pro lie Pro Val 
1 5 

<210> 51 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 51 

Lys Arg Trp He He Leu Gly Leu Asn Lys 
15 10 

<210> 52 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 52 

Val Thr Leu Trp Gin Arg Pro Leu Val 

1 5 

<210> 53 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 53 

Leu Val Gly Pro Thr Pro Val Asn He 
1 5 

<210> 54 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 54 

Ala Leu Val Glu He Cys Thr Glu Met 
1 5 

<210> 55 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 55 

Tyr Thr Ala Phe Thr He Pro Ser He 
1 5 

<210> 56 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 56 

Ala He Phe Gin Ser Ser Met Thr Lys 
1 5 

<210> 57 
<211> 9 
<212> PRT 
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<213> Human immunodeficiency virus 
<400> 57 

Val lie Tyr Gin Tyr Met Asp Asp Leu 
1 5 

<210> 58 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 58 

Lys He Glu Glu Leu Arg Gin His Leu 
1 5 

<210> 59 
<211> 9 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 59 

He Leu Lys Glu Pro Val His Gly Val 

1 

<210> 60 

<211> 10 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 60 

He Tyr Gin Glu Pro Phe Lys Asn Leu Lys 
1 5 10 

<210> 61 

<211> 10 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 61 

Thr Trp Glu Thr Trp Trp Thr Glu Tyr Trp 
1 5 10 

<210> 62 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 62 

Pro Leu Val Lys Leu Trp Tyr Gin Leu 
1 5 

<210> 63 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 63 



Tyr Leu Ala Trp Val Pro Ala His Lys 
1 5 
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<210> 64 
<211> 9 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 64 

Leu Leu Trp Lys Gly Glu Gly Ala Val 
1 5 

<210> 65 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 65 

Ala lie lie Arg lie Leu Gin Gin Leu 

1 5 

<210> 66 

<211> 10 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 66 

lie Pro lie Val Ala lie Val Ala Leu Val 
1 5 10 

<210> 67 

<211> 10 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 67 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val 
1 5 10 

<210> 68 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 68 

Cys Thr Cys Gly Ser Ser Asp Leu Tyr 
1 5 

<210> 69 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 69 

Phe Leu Leu Leu Ala Asp Ala Arg Val 
1 5 

<210> 70 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 70 
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Tyr Leu Val Ala Tyr Gin Ala Thr Val 
1 5 

<210> 71 . 

<211> 10 

<212> PRT 

<213> Hepatitis C virus 

<400> 71 

Arg Leu He Val Phe Pro Asp Leu Gly Val 
1 5 10 

<210> 72 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 72 

Asp Leu Met Gly Tyr He Pro Leu Val 
1 5 

<210> 73 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 73 

Trp Met Asn Arg Leu He Ala Phe Ala 
1 5 

<210> 74 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 74 

Val Leu -Val Gly Gly Val Leu Ala Ala 
1 5 

<210> 75 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 75 

His Met Trp Asn Phe He Ser Gly lie 
1 5 

<210> 76 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 76 

He Leu Ala Gly Tyr Gly Ala Gly Val 
1 5 

<210> 77 
<211> 10 
<212> PRT 
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<213> Hepatitis C virus 

<400> 77 

Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu 
1 5 10 

<210> 78 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 78 

Leu Leu Phe Leu Leu Leu Ala Asp Ala 
1 5 

<210> 79 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 79 

Tyr Leu Val Thr Arg His Ala Asp Val 
1 5 

<210> 80 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 80 

Lys Thr Ser Glu Arg Ser Gin Pro Arg 
1 . 5 

<210> 81 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 81 

Arg Leu Gly Val Arg Ala Thr Arg Lys 
1 5 

<210> 82 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 82 

Gin Leu Phe Thr Phe Ser Pro Arg Arg 
1 5 

<210> 83 

<211> 10 

<212> PRT 

<213> Hepatitis C virus 

<400> 83 



Arg Met Tyr Val Gly Gly Val Glu His Arg 
1 5 10 
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<210> 84 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 84 

Leu lie Phe Cys His Ser Lys Lys Lys 
1 5 

<210> 85- 

<211> 10 

<212> PRT 

<213> Hepatitis C virus 

<400> 85 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys 
15 10 

<210> 86 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 86 

Val Ala Gly Ala Leu Val Ala Phe Lys 
1 5 

<210> 87 

<211> 10 

<212> PRT 

<213> Hepatitis C virus 

<400> 87 

Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys 
1 5 10 

<210> 88 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 88 

Leu Pro Gly Cys Ser Phe Ser He Phe 
1 5 

<210> 89 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 89 

Leu Ser Ala Phe Ser Leu His Ser Tyr 
1 5 

<210> 90 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 90 
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Phe Trp Ala Lys His Met Trp Asn Phe 
1 5 

<210> 91 

<211> 9 

<212> PRT 

<213> Hepatitis C virus 

<400> 91 

Val Gly lie Tyr Leu Leu Pro Asn Arg 
1 5 

<210> 92 

<211> 10 

<212> PRT 

<213> Hepatitis C virus 

<400> 92 

Leu Leu Phe Asn lie Leu Gly Gly Trp Val 
1 5 " 10 

<210> 93 

<211> 8 

<212> PRT 

<213> Hepatitis C virus 

<400> 93 

He Pro Phe Tyr Gly Lys Ala He 
1 5 

<210> 94 

<211> 242 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 94 

Met Val Asp Phe Leu Arg. Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe 
1 5 10 15 

Trp Leu Ala Ala He His Arg Cys Gin Ala He Arg Lys Lys Arg Arg 
20 25 30 

He Tyr Asp Leu He Glu Leu Val Gin Pro Pro Gin Leu Thr Leu Gin 
35 40 45 

Val Gly Leu Cys Thr Leu Val Ala Met Leu Arg Leu Arg Ala Glu Ala 
50 55 60 

Gin Val Lys He Glu Asp Pro Pro Phe Asn Ser Leu Tyr Leu Leu Glu 
65 70 75 80 

Met Leu Trp Arg Leu Gly Gin Gly Gly Ser Pro Thr Ala Met Ala Val 
85 90 95 

Leu Leu His Glu Glu Ser Met He Ala Leu Tyr Leu Gin Gin Asn Trp 
100 105 110 

Trp Thr Leu Arg Ala Lys Phe Lys Gin Leu Leu Ser Ser Cys Ser Ser 
115 120 125 
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Cys Pro Leu Ser Lys He Thr Tyr Gly Pro Val Phe Met Cys Leu Gin 
130 135 140 

Ala Lys Trp Arg Leu Gin Thr Leu Arg Pro Pro He Phe He Arg Arg 
145 150 155 160 

Leu Val Ser Phe He Glu Phe Val Gly Trp Tyr Pro Leu His Glu Gin 
165 170 175 

His Gly Met Val Glu He Thr Pro Tyr Lys Pro Thr Trp Cys Leu Gly 
180 185 190 

Gly Leu Leu Thr Met Val Glu Glu Asn Leu Leu Asp Phe Val Arg Phe 
195 200 205 

Thr Tyr Ser Ala Gly He Val Gin He Leu Leu Asp Phe Val Arg Phe 
210 215 220 

Met Gly Val Glu Gly Gly Val Gly Trp Arg His Trp His His His His 
225 • 230 235 240 

His His 



<210> 95 

<211> 242 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 95 

Met Val Asp Phe Leu Arg Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe 
1 5 10 15 

Trp Leu Ala Ala He His Arg Cys Gin Ala He Arg Lys Lys Arg Arg 
20 25 30 

He Tyr Asp Leu He Glu Leu Gly Leu Cys Thr Leu Val Ala Met Leu 
35 40 45 

Arg Leu Arg Ala Glu Ala Gin Val Lys He Glu Asp Pro Pro Phe Asn 
50 55 '60 

Ser Leu Thr Tyr Ser Ala Gly He Val Gin He Leu Leu Asp Phe Val 
65 70 75 80 

Arg Phe Met Gly Val Glu Gly Gly Val Gly Trp Arg His Trp He Ala 
85 90 95 

Leu Tyr Leu Gin Gin Asn Trp Trp Thr Leu Arg Ala Lys Phe Lys Gin 
100 105 110 

Leu Leu Ser Ser Cys Ser Ser Cys Pro Leu Ser Lys He Thr Tyr Gly 
115 120 125 

Pro Val Phe Met Cys Leu Gin Ala Lys Trp Arg Leu Gin Thr Leu Arg 



130 



135 



140 



Pro 
145 



Pro 



lie 



Phe 



He 



Arg 
150 



Arg 



Leu Val Ser Phe He Glu Phe Val Gly 
155 160 
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Trp Tyr Pro Leu His Glu Gin His Gly Met Val Glu lie Thr Pro Tyr 
165 170 175 

Lys Pro Thr Trp Cys Leu Gly Gly Leu Leu Thr Met Val Glu Glu Asn 
180 185 190 

Leu Leu Asp Phe Val Arg Phe Tyr Leu Leu Glu Met Leu Trp Arg Leu 
195 200 205 

Gly Gin Gly Gly Ser Pro Thr Ala Met Ala Val Leu Leu His Glu Glu 
210 . 215 220 

Ser Met Val. Gin Pro Pro Gin Leu Thr Leu Gin Val His His His His 
225 230 235 240 

His His 



<210> 96 

<211> 669 

<212> PRT 

<213> epstein-barr virus (EBV) 

<400> 96 

Met Glu Asp Pro Gly Phe Phe Asn Val Glu lie Pro Glu Phe Pro Phe 
1 5 10 15 

Tyr Pro Thr Cys Asn Val Cys Thr Ala Asp Val Asn Val Thr lie Asn 
20 25 30 

Phe Asp Val Gly Gly Lys Lys His Gin Leu Asp Leu Asp Phe Gly Gin 
35 40 45 

Leu Thr Pro His Thr Lys Ala Val Tyr Gin Pro Arg Gly Ala Phe Gly 
50 55 60 

Gly Ser Glu Asn Ala Thr Asn Leu Phe Leu Leu Glu Leu Leu Gly Ala 
65 70 75 80 

Gly Glu Leu Ala Leu Thr Met Arg Ser Lys Lys Leu Pro lie Asn Val 
85 90 95 

Thr Thr Gly Glu Glu Gin Gin Val Ser Leu Glu Ser Val Asp Val Tyr 
100 105 110 

Phe Gin Asp Val Phe Gly Thr Met Trp Cys His His Ala Glu Met Gin 
115 120 125 

Asn Pro Val Tyr Leu lie Pro Glu Thr Val Pro Tyr lie Lys Trp Asp 
130 135 140 

Asn Cys Asn Ser Thr Asn lie Thr Ala Val Val Arg Ala Gin Gly Leu 
145 150 155 160 

Asp Val Thr Leu Pro Leu Ser Leu Pro Thr Ser Ala Gin Asp Ser Asn 
165 170 175 

Phe Ser Val Lys Thr Glu Met Leu Gly Asn Glu He Asp He Glu Cys 
180 185 190 
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lie Met Glu Asp Gly Glu lie Ser Gin Val Leu Pro Gly Asp Asn Lys 
195 200- 205 

Phe Asn He Thr Cys Ser Gly Tyr Glu Ser His Val Pro Ser Gly Glv 
210 215 220 

He Leu Thr Ser Thr Ser Pro Val Ala Thr Pro He Pro Gly Thr Glv 
225 230 235 240 

Tyr Ala Tyr Ser Leu Arg Leu Thr Pro Arg Pro Val Ser Arg Phe Leu 

245 . 250 255 

Gly Asn Asn Ser He Leu Tyr Val Phe Tyr Ser Gly Asn Gly Pro Lys 
260 265 ~ 270 

Ala Ser Gly Gly Asp Tyr Cys He Gin Ser Asn He Val Phe Ser Asp 
275 280 285 

Glu He Pro Ala Ser Gin Asp Met Pro Thr Asn Thr Thr Asp He Thr 
290 295 300 

Tyr Val Gly Asp Asn Ala Thr Tyr Ser Val Pro Met Val Thr Ser Glu 
305 310 315 320 

Asp Ala Asn Ser Pro Asn Val Thr Val Thr Ala Phe Trp Ala Trp Pro 
325 330 335 • 

Asn Asn Thr Glu Thr Asp Phe Lys Cys Lys Trp Thr Leu Thr Ser Gly 
340 345 350 

Thr Pro Ser Gly Cys Glu Asn He Ser Gly Ala Phe Ala Ser Asn Arg 
355 360 365 

Thr Phe Asp He Thr Val Ser Gly Leu Gly Thr Ala Pro Lys Thr Leu 
370 375 ~ 380 

He He Thr Arg Thr Ala Thr Asn Ala Thr Thr Thr Thr His Lys Val 
385 390 395 400 

He Phe Ser Lys Ala Pro Glu Ser Thr Thr Thr Ser Pro Thr Leu Asn 
405 410 415 

Thr Thr Gly Phe Ala Asp Pro Asn Thr Thr Thr Gly Val Asp Phe Leu 
420 425 - 430 

Arg Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe Trp Leu Ala Ala He 
435 440 445 

His Arg Cys Gin Ala He Arg Lys Lys Arg Arg He Tyr Asp Leu He 
450 455 460 

Glu Leu Val Gin Pro Pro Gin Leu Thr Leu Gin Val Gly Leu Cys Thr 
465 470 475 480 

Leu Val Ala Met Leu Arg Leu Arg Ala Glu Ala Gin Val Lys He Glu 
485 490 495 

Asp Pro Pro Phe Asn Ser Leu Tyr Leu Leu Glu Met Leu Trp Arg Leu 
500 505 510 
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Gly Gin Gly Gly Ser Pro Thr Ala Met Ala Val Leu Leu His Glu Glu 
515 " . 520 . 525 

Ser Met lie Ala Leu Tyr Leu Gin Gin Asn Trp Trp Thr Leu Arg Ala 
530 535 540 

Lvs Phe Lys Gin Leu Leu Ser Ser Cys Ser Ser Cys Pro Leu Ser Lys 
545 550 555 560 

He Thr Tyr Gly Pro Val Phe Met Cys Leu Gin Ala Lys Trp Arg Leu 
565 . 570 575 

Gin Thr Leu Arg Pro Pro lie Phe He Arg Arg Leu Val Ser Phe He 
580 585 590 

Glu Phe Val Gly Trp Tyr Pro Leu His Glu Gin His Gly Met Val Glu 
595 600 605 

He Thr Pro Tyr Lys Pro Thr Trp Cys Leu Gly Gly Leu Leu Thr Met 
610 615 620 

Val Glu Glu Asn Leu Leu Asp Phe Val Arg Phe Thr Tyr Ser Ala Gly 
625 630 635 640 

He Val Gin He Leu Leu Asp Phe Val Arg Phe Met Gly Val Glu Gly 
645 650 655 

Gly Val Gly Trp Arg His Trp His His His His His His 
660 665 

<210> 97 
<211> 669 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 97 

Met Glu Asp Pro Gly Phe Phe Asn Val Glu He Pro Glu Phe Pro Phe 
15 10 15 

Tyr Pro Thr Cys Asn Val Cys Thr Ala Asp Val Asn Val Thr He Asn 
20 25 30 

Phe Asp Val Gly Gly Lys Lys His Gin Leu Asp Leu Asp Phe Gly Gin 
35 40 45 

Leu Thr Pro His Thr Lys Ala Val Tyr Gin Pro Arg Gly Ala Phe Gly 
50 55 60 

Gly Ser Glu Asn Ala Thr Asn Leu Phe Leu Leu Glu Leu Leu Gly Ala 
65 70 75 80 

Gly Glu Leu Ala Leu Thr Met Arg Ser Lys Lys Leu Pro He Asn Val 
85 90 95 

Thr Thr Gly Glu Glu Gin Gin Val Ser Leu Glu Ser Val Asp Val Tyr 
100 105 110 

Phe Gin Asp Val Phe Gly Thr Met Trp Cys His His Ala Glu Met Gin 
115 120 125 
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ASI1 130 ^ 135 Glil Pr ° ^ Ile LyS Trp Asp 

Asn Cys Asn Ser Thr Asn Ile Thr Ala Val Val Arg Ala Gin Gly Leu 



150 



155 



160 



Asp Val Thr Leu Pro Leu Ser Leu Pro Thr Ser Ala Gin Asp Ser Asn 
165 170 175 

Phe Ser Val Lys Thr Glu Met Leu Gly Asn Glu Ile Asp Ile Glu Cys 
180 185 



Ile Met Glu Asp Gly Glu Ile Ser Gin Val Leu Pro Gly Asp Asn Lys 

195 200 205 

Phe Asn Ile Thr Cys Ser Gly Tyr Glu Ser His Val Pro Ser Gly Gly 

210 lie J J 



215 



220 



lie Leu Thr Ser Thr Ser Pro Val Ala Thr Pro Ile Pro Gly Thr Gly 



230 



235 



240 



Tyr Ala Tyr Ser Leu Arg Leu Thr Pro Arg Pro Val Ser Arg Phe Leu 
245 250 255 

Gly Asn Asn Ser Ile Leu Tyr Val Phe Tyr' Ser Gly Asn Gly Pro Lvs 
260 265 270 

Ala Ser Gly Gly Asp Tyr Cys lie Gin Ser Asn Ile Val Phe Ser Asp 
275 280 285 

Glu lit Pro Ala Ser Gln Asp Met Pro Thr Asn Thr Thr Asp lie Thr 
290 295 300 

Tyr Val Gly Asp Asn Ala Thr Tyr Ser Val Pro Met Val Thr Ser Glu 
305 310 315 320 

Asp Ala Asn Ser Pro Asn Val Thr Val Thr Ala Phe Trp Ala Trp Pro 
325 330 335 

Asn Asn Thr Glu Thr Asp Phe Lys Cys Lys Trp Thr Leu Thr Ser Glv 
340 345 350 

Thr Pro Ser Gly Cys Glu Asn Ile Ser Gly Ala Phe Ala Ser Asn Arg 
355 360 365 

Thr Phe Asp Ile Thr Val Ser Gly Leu Gly Thr Ala Pro Lys Thr Leu 
370 375 380 

lie Ile Thr Arg Thr Ala Thr Asn Ala Thr Thr Thr Thr His Lys Val 
385 390 395 400 

Ile Phe Ser Lys Ala Pro Glu Ser Thr Thr Thr Ser Pro Thr Leu Asn 
405 410 415 

Thr Thr Gly Phe Ala Asp Pro Asn Thr Thr Thr Gly Val Asp Phe Leu 
420 425 430 

Arg Gly Arg Ala Tyr Gly Leu Pro Tyr Leu Phe Trp Leu Ala Ala Ile 
435 440 445 
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His Arg Cys Gin Ala He Arg Lys Lys Arg Arg lie Tyr Asp Leu He 
450 455 460 

Glu Leu Gly Leu Cys Thr Leu Val Ala Met Leu Arg Leu Arg Ala Glu 
465 470 475 480 

Ala Gin Val Lys He Glu Asp Pro Pro Phe Asn Ser Leu Thr Tyr Ser 
485 490 495 

Ala Gly He Val Glri He Leu Leu Asp Phe Val Arg Phe Met Gly Val 
500 505 510 

Glu Gly Gly Val Gly Trp Arg His Trp He Ala Leu Tyr Leu Gin Gin 
515 520 525 

Asn Trp Trp Thr Leu Arg Ala Lys Phe Lys Gin Leu Leu Ser Ser Cys 
530 535 540 

Ser Ser Cys Pro Leu Ser Lys lie Thr Tyr Gly Pro Val Phe Met Cys 
545 550 555 560 

Leu Gin Ala Lys . Trp Arg Leu Gin Thr Leu Arg Pro Pro He Phe He 
565 570 575 

Arg Arg Leu Val Ser Phe He Glu Phe Val Gly . Trp Tyr Pro Leu His 
580 585 590 

Glu Gin His Gly Met Val Glu He Thr Pro Tyr Lys Pro Thr Trp Cys 
595 600 605 

Leu Gly Gly Leu Leu Thr Met Val Glu Glu Asn Leu Leu Asp Phe Val 
610 615 620 

Arg Phe Tyr Leu Leu Glu Met Leu Trp Arg Leu Gly Gin Gly Gly Ser 
625 630 635 640 

Pro Thr Ala Met Ala Val Leu Leu His Glu Glu Ser Met Val Gin Pro 
645 650 655 

Pro Gin Leu Thr Leu Gin Val His His His His His His 
660 665 

<210> 98 
<211> 187 
<212> PRT 

<213> epstein-barr virus (EBV) 
<400> 98 

Ser Ser Cys Ser Ser Cys Pro Leu Ser Lys He His Arg Cys Gin Ala 
15 10 15 

He Arg Lys Lys Cys Leu Gly Gly Leu Leu Thr Met Val Leu Thr Ala 
20 25 30 

Gly Phe Leu He Phe Leu Arg Leu Arg Ala Glu Ala Gin Val Lys He 
35 40 45 

Glu Asp Pro Pro Phe Asn Ser Leu Leu Leu Ser Ala Trp He Leu Thr 
50 55 60 
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Ala Arg Arg Arg Trp Arg Arg Leu Thr Val Pro Tyr Leu Phe Trp Leu 
65 70 75 80 

Ala Ala lie Tyr Leu Leu Glu Met Leu Trp Arg Leu Gly Gin Glv Glv 
85 90 95 

Ser Pro Thr Ala Met Val Met Ser Asn Thr Leu Leu Ser Ala Trp Ala 
100 105 HO 

Leu Leu Val Leu Tyr Ser Phe Ala Arg Ala Lys Phe Lys Gin Leu Leu 
115 120 125 

lie Ala Leu Tyr Leu Gin Gin Asn Trp Thr Tyr Gly Pro Val Phe Met 
130 135 140 

Cys Leu Gin Ala Lys Trp Arg Leu Gin Thr Leu Tyr Leu Gin Gin Asn 
"5 150 155 160 

Trp Trp Thr Leu Tyr Pro Leu His Glu Gin His Gly Met Cys Pro Leu 
165 170 . 175 

Ser Lys He Leu Leu His His His His His His 
180 185 

<210> 99 
<211> 246 
<212> PRT 

<213> Human immunodeficiency virus - 
<400> 99 

He Pro lie Val Ala He Val Ala. Leu Val Arg Leu Arg Pro Gly Gly 
1 5 - 10 -15 

Lys Lys Lys He Leu Lys Glu Pro Val His Gly Val Pro Leu Val Lys 
20 25 " 30 

Leu Trp Tyr Gin Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
35 40 45 ' * 

Tyr Lys Leu Lys His He Val Trp Thr Trp Glu Thr Trp Trp Thr Glu 
50 55 60 

Tyr Trp Glu He Lys Asp Thr Lys Glu Ala Leu Lys Arg Trp He He 
65 70 75 80 

Leu Gly Leu Asn Lys Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Lys 
85 90 95 

He Glu Glu Leu Arg Gin His Leu Met Thr Asn Asn Pro Pro He Pro 
100 105 no 

Val Val Thr Leu Trp Gin Arg Pro Leu Val Trp Ala Ser Arg Glu Leu 
115 120 125 

Glu Arg Phe Leu Leu Trp Lys Gly Glu Gly Ala Val Tyr Thr Ala Phe 
130 135 140 

Thr He Pro Ser He He Tyr Gin Glu Pro Phe Lys Asn Leu Lys Ser 
145 150 155 16Q 
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Leu 



Tyr 



Asn Thr 



Val 
165 



Ala 



Thr Leu Ala 



He He Arg He Leu 
170 



Gin 
175 



Gin 



Leu Ala He Phe Gin Ser Ser Met Thr Lys Val He Tyr Gin Tyr Met 

180 185 190 

Asp Asp Leu Leu Val Gly Pro Thr Pro Val Asn He Thr Pro Gin Asp 

195 200 205 . 

Leu Asn Thr Met Leu Tyr Leu Ala Trp Val Pro Ala His Lys Ala Leu 

. 210 215 220 

Val Glu He Cys Thr Glu Met Thr Leu Asn Ala Trp Val Lys Val Val 

225 230 235 240 

His His His His His His 



<210> 100 

<211>- 246 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 100 

He Leu Lys Glu Pro Val His Gly Val Lys Arg Trp He He Leu Gly 
1 5 10 15 

Leu Asn Lys Thr Leu Asn Ala Trp Val Lys Val Val Lys He Glu Glu 
20 25 30 

Leu Arg Gin His Leu Tyr Thr Ala Phe Thr He Pro Ser He Trp Ala 
35 40 45 

Ser Arg Glu Leu' Glu Arg Phe Pro Leu Val Lys Leu Trp Tyr Gin Leu 
50 55 60 

Leu Val Gly Pro Thr Pro Val Asn He He Pro He Val Ala He Val 
65 " 70 75 80 

Ala Leu Val Ser Leu Tyr Asn Thr Val Ala Thr Leu Met Thr Asn Asn 



Pro Pro He Pro Val Leu Leu Trp Lys Gly Glu Gly Ala Val Thr Trp 
100 105* • 110 

Glu Thr Trp Trp Thr Glu Tyr Trp Ala lie Phe Gin Ser Ser Met Thr 
115 ~ 120 125 

Lys Arg Leu -Arg Pro Gly Gly Lys Lys Lys Ala Leu Val Glu He Cys 
130 135 140 

Thr Glu Met Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Tyr Leu Aid 

145 150 155 160 

Trp Val Pro Ala His Lys Ala He He Arg He Leu Gin Gin Leu Glu 
165 170 175 

lie Lys Asp Thr Lys Glu Ala Leu Val Thr Leu Trp Gin Arg Pro Leu 



245 



85 



90 



95 



180 



185 



190 
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Val He Tyr Gin Glu Pro Phe Lys "Asn Leu Lys Val He Tvr Gin Tvr 
195 200 205 

Met Asp Asp Leu Thr Pro Gin Asp Leu Asn Thr Met Leu Arg Pro Glv 
210 215 220 

Gly Lys Lys Lys Tyr Lys Leu Lys Tyr Lys Leu Lys His He Val Trr> 
225 230 235 2 40 

His His His His His His 
245 

<210> 101 
<211> 245 
<212> PRT 

<213> Hepatitis C virus 
<400> 101 

Leu Leu Phe Asn lie Leu Gly Gly Trp Val Lys Thr Ser Glu Arg Ser 
1 5 10 15 

Gin Pro Arg Phe Leu Leu Leu Ala Asp Ala Arg Val Leu Leu Phe Leu 
20 25 30 

Leu Leu Ala Asp Ala Arg Leu Gly Val Arg Ala Thr Arg Lys Gly Val 
35 40 45 - 

Ala Gly Ala Leu Val Ala Phe Lys Leu Pro Gly Cys Ser Phe Ser He 
50 55 60 

Phe Arg Met Tyr Val Gly Gly Val Glu His Arg Val Ala Gly Ala Leu 
65 70 75 80 

Val Ala Phe Lys Asp Leu Met Gly Tyr He Pro Leu Val Leu He Phe 
85 90 95 

Cys His Ser Lys Lys Lys lie Leu Ala Gly Tyr Gly Ala Gly Val His 
100 105 " ~ no 

Met Trp Asn Phe He Ser Gly. He Gin Leu Phe Thr Phe Ser Pro Arcr 
115 120 125 

Arg Val Gly He Tyr Leu Leu Pro Asn Arg Phe Trp Ala Lys His Met 
130 135 140 

Trp Asn Phe Tyr Leu Val Thr Arg His Ala Asp Val Leu Ser Ala Phe 
145 150 155 160 

Ser Leu His Ser Tyr Trp Met Asn Arg Leu He Ala Phe Ala Tyr Leu 
165 170 175 

Leu Pro Arg Arg Gly Pro Arg Leu Tyr Leu Val Ala Tyr Gin Ala Thr 
180 185 190 

Val Arg Leu He Val Phe Pro Asp Leu Gly Val Thr Leu Gly Phe Gly 
3-95 200 205 

Ala Tyr Met Ser Lys He Pro Phe Tyr Gly Lys Ala He Val Leu Val 
210 215 220 
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Gly Gly Val Leu Ala Ala Cys Thr Cys Gly Ser Ser Asp Leu Tyr His 
225 230 235 240 

His His His His His 
245 

<210> 102 
<211> 245 
<212> PRT 

<213> Hepatitis C virus 
<400> 102 

Tyr Leu Val Thr Arg His Ala Asp Val Trp Met Asn Arg Leu lie Ala 
15 10 15 

Phe Ala Cys Thr Cys Gly Ser Ser Asp Leu Tyr Val Gly lie Tyr Leu 
20 25 30 

Leu Pro Asn Arg. lie Leu Ala Gly Tyr Gly Ala Gly Val Leu lie Phe 
35 40 45 

Cys His Ser Lys Lys Lys His Met Trp Asn Phe lie Ser Gly lie Phe 
50 55 60 

Leu Leu Leu Ala Asp Ala Arg Val Leu Leu Phe Leu Leu Leu Ala Asp 
65 70 . 75 SO 

Ala Leu Ser Ala Phe Ser Leu. His Ser Tyr Gly Val Ala Gly Ala Leu 
85 90 -95 

Val Ala Phe Lys Lys Thr Ser Glu Arg Ser Gin Pro Arg Arg Leu Gly 
100 105 110 . 

Val Arg Ala Thr Arg Lys Asp Leu Met Gly Tyr lie Pro Leu Val Val 
115 120 125 

Leu Val Gly Gly Val Leu Ala Ala Leu Leu Phe Asn He Leu Gly Gly 
130 135 140 

Trp Val Tyr Leu Val Ala Tyr Gin Ala Thr Val Tyr Leu Leu Pro Arg 
145 150 155 160 

Arg Gly Pro Arg Leu Phe Trp Ala Lys His Met Trp Asn Phe Gin Leu 
165 170 175 

Phe Thr Phe Ser Pro Arg Arg Leu Pro Gly Cys Ser Phe Ser He Phe 
180 185 190 

He Pro Phe Tyr Gly Lys Ala He Arg Met Tyr Val Gly Gly Val Glu 
195 200 205 

His Arg Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Arg Leu He Val 
210 215 220 

Phe Pro Asp Leu Gly Val Val Ala Gly Ala Leu Val Ala Phe Lys His 
225 230 235 240 

His His His His His 
245 
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Figure 3A gp350/PT26A 



MEUDPGFFNVEIPEFPFYPTCNVCTADVIWTINFDVGGKK^ 

T-help epitope 70 

NLFLLELLGAGELAI/TMRSKKLPINVTTGEEQQ 

140 

IK WDMCNSTNITAVVRR.QGLDVT LPLSLPTSAQDSNFSVKTEMLGNEIPIECIMEDGEISQVLPGDNKFN 
T-help epitope 210 
ITCSGYESHVPSGGILTSTSPVATPIPGTGYAYSLRLTPRPVSRFLGNNSILYVFYSGNGPKASGGDYCI 

280 

QSNIVFSDEIPASQDMPTNTTDITYVGDNATYSVTMOT 

350 

SGTPSGCENISGAFASNRTFDIWSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLOTTGF 

420 

ADPNTTTG VZ ^LRGRAYGL PYIiFWIAi^ HRCQAIRKKR RI YDIil EII iVQP PQLTLQV GLCTLVAM LRLRAE 

428 B8 A23 B27 B27 B46 A2 A3 

AQVKIEPPPFNSI/YLLEfrn^WRI^QGGSPTAMAVL^ 

B60 A2 B62 B35 B57/A2 B8 All 

ITYGPVFMCL QAKWRIfQTL RPPIFIRRLV SFIEFV 

A24 B8 B7 ~" B58 ~ B35 B44 A2 

DFVRFTYS AG IVQILLDFVRFMGV^GGVGWRHWflHHffHH 
B44 A24 A2 B44 (SEQ XD NO: 96) 



Figure 3B gp350/PT26B 

JffiDPGFFNVEI PEF PFYPTC WCTADVNVTINFDVGGKKHQLDLDF GQLTPHTKAV YQPRGAFGGSENAT • 

T-help epitope 70 

NLFLLELLGAGE1ALTMRSKKLPINVTTGEEQQVSLESVDWFQDW 

140 

IKV TONCNSTNITAVVRAQGLDOT LPLSLPTSAQDSNFSVCT 

T-help epitope .210 
ITCSGYESHVPSGGILTSTSPVATPIPGTGYAYSLRLTPRPVSRFLGNNSILYVFYSGNGPKASGGDYCI. 

280 

QSNIVFSDE I PASQDMPTNTTD ITYVGDNATYSVPMVT SEDANS PWTVTAFWAWPNNTETDFKCKWTLT 

350 

SGTPSGCENI SGAFASNRTFDITVSGLGTAPKTLI ITRTATNATTTTHKVIFSKAPESTTTS PTLNTTGF 

420 

AD PNTTTG VPFLRGRAYGLP YLFWLAA1HRCQ A I RKKRRXYDL I ELGLCTLVAMLRLRAEAQ VKIEDPPF 
428 B8 A23 B27 B27 A2 A3 B60 

NSL TYSAGIVQI LLDFVRFMGVE GGVGWRH WIAIjYLQ 

A24 A2 • B44 ' B57/A2 B8 All A24 

LQAKWRItQTIi RPPIFIRR LVSFIEFVGVf YPLH^ 

B8 B7 B58 B35 B44 A2 B44 



MLWR LGQGGSPTAM AVIiliHEES MVQPPQLTLQV HHHHHg (SEQ ID NO: 97) 
A2 B62 B35 B46 
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Figure 4A -IPTG + EPTG 
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gp350PT26A-ISCOM vaccine (sub. cut) 




I 1 100 



§• 50 



gp350PT26B-ISCOM vaccine (sub. cut) 




Positive contra! -Peptide mix 
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Figure 9 




This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record. 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ black borders 

□ image cut off at top, bottom or sides 
□Haded text or drawing 
□^blurred or illegible text or drawing 

□ skewed/slanted images 

□ color or black and white photographs 

□ gray scale documents 

□ lines or marks on original document 

□ reference(s) or exhibit(s) submitted are poor quality 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



